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Abstract. We obtain sharp estimates on the growth rate of stable commu- 
tator length on random (geodesic) words, and on random walks, in hyperbolic 
groups and groups acting nondegenerately on hyperbolic spaces. In either case, 
we show that with high probability stable commutator length of an element of 
length n is of order n/logn. 

This establishes quantitative refinements of qualitative results of Bestvina- 
Fujiwara and others on the infinite dimensionality of 2-dimensional bounded 
cohomology in groups acting suitably on hyperbolic spaces, in the sense that 
we can control the geometry of the unit balls in these normed vector spaces 
(or rather, in random subspaces of their normed duals). 

As a corollary of our methods, we show that an clement obtained by random 
walk of length n in a mapping class group cannot be written as a product of 
fewer than 0(n/ log n) reducible elements, with probability going to 1 as n 
goes to infinity. We also show that the translation length on the complex of 
free factors of a random walk of length n on the outer automorphism group of 
a free group grows linearly in n. 
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1. Introduction 

The goal of this paper is to use probabilistic and ergodic theoretic methods to 
obtain bounds and estimates for the growth rate of characteristic norms in certain 
classes of groups, especially stable commutator length in hyperbolic groups (and some 
groups acting on hyperbolic spaces). These growth rates are a quantitative obstruc- 
tion to the existence of nontrivial homomorphisms between groups, which refine 
known qualitative obstructions due to Bestvina-Fujiwara [4] and others. They also 
illustrate a probabilistic connection between the growth rate of geometric and alge- 
braic quantities, which are a manifestation of the phenomenon of Mostow Rigidity 
in a broad context. 

1.1. Concentration and hyperbolicity. Let £„ be a sequence of random real 
variables. For many natural sequences £„ one can prove a law of large numbers, i.e. 
the almost sure existence of a limit L := lim„_ i . 00 £ n /n (this is colloquially known 
as convergence to the mean). Such laws of large numbers hold under remarkably 
broad hypotheses. 

It is natural to look for examples in geometric group theory, and it is not hard 
to find them. If G is a group and £ : G — » R + any subadditive function, one 
can take £„ to be the value of £ on some Markov process on G. For any group 
G, random walk is the most obvious and important example of a natural Markov 
process, but for specific classes of groups, other natural Markov processes exist, 
arising (for example) from an automatic structure on G. The law of large numbers 
then follows from an application of Kingman's subadditive ergodic theorem. 

For example, when G is hyperbolic, Cannon [13] showed how to construct a 
stationary finite Markov chain which gives a geodesic combing of G. And when 
G is the fundamental group of a closed, negatively curved manifold, Ratner [41] 
showed how to build a Markov coding for the geodesic flow, whose closed orbits 
are naturally in bijection with conjugacy classes in G. Roughly speaking, the 
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geometric hyperbolicity implies dynamical hyperbolicity for the geodesic flow, and 
this dynamical hyperbolicity gives rise to laws of large numbers. 

It is rarer to find examples where £„ has sublinear growth, but nevertheless 
concentrates at some deterministic scale. Colloquially, we say that a sequence 
of non-negative random variables £ n is concentrated if there is some deterministic 
function f(n) so that £ n / f{n) converges in probability to a Dirac mass at 1, and 
we say that £„ is compressed if (again for some f(n j) every weak limit of £ n //(n) 
is a probability measure on R+ with support bounded away from and oo. 

1.2. Bounded cohomology. Bounded cohomology, as introduced by Gromov [25], 
is (among other things) a functor from the category of groups and homomorphisms 
to the category of normed vector spaces and norm-decreasing linear maps. One 
of the main virtues of this functor is its monotonicity: if the invariants associ- 
ated to a group G are "smaller" than the invariants associated to a group H, 
there are no interesting homomorphisms from G to H. As a well-known example, 
Bestvina-Fujiwara [4] used 2-dimensional bounded cohomology to show that every 
homomorphism from a higher rank lattice to a mapping class group factors through 
a finite group (this fact was known earlier by work of Farb-Masur [19], building on 
work of Kaimanovich-Masur [33] ) . 

To this date, such tools have generally been used somewhat crudely, because 
of the enormous difficulty in computing bounded cohomology, or deriving useful 
invariants from it. Most authors have concentrated on bounded cohomology in 
dimension 2, and have focused almost exclusively on a trichotomous distinction: 
namely for a given group G, whether H 2 {G) (i.e. 2-dimensional bounded cohomol- 
ogy with real coefficients) is trivial, nontrivial but finite dimensional, or infinite 
dimensional. 

In a way, this misses the main point, which is that H 2 {G) is canonically a Banach 
space. Almost nothing is known about the (large scale) geometry of this Banach 
space in any nontrivial cases. In this paper, we are able to derive strong geometric 
information about the geometry of these Banach spaces in the important case of 
hyperbolic groups (and some groups acting suitably on hyperbolic spaces) . This is 
done via the relationship between 2-dimensional bounded cohomology, quasimor- 
phisms, and stable commutator length. 

For any group G there is an exact sequence of real vector spaces 

fl^G) -> Q(G) -> H%(G) -> H 2 (G) 

where Q denotes the space of homogeneous quasimorphisms on G (sec § 2 for a 
precise definition, and e.g. [10] Thm. 2.50 for a proof). For a finitely presented 
group, H 1 and H 2 are finite dimensional, so H 2 and Q carry (almost) the same 
information. Moreover, H 2 and Q/H 1 are Banach spaces in a functorial way, and 
the map Q/H 1 — » H 2 is 2-bilipschitz. We focus on Q/H 1 in this paper, although 
it would be straightforward to reinterpret our results in terms of H 2 . 

Bavard [2] interpreted Q/H 1 in terms of an a priori algebraic invariant called 
stable commutator length (hereafter scl). If G is a group and [G, G] denotes its 
commutator subgroup, the commutator length of any g <E [G, G] (denoted c\(g)) is 
the least number of commutators in G whose product is equal to g, and the stable 
commutator length of g is the limit 



scl(g) := lim c\(g n )/n 

n— 7-oc 
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In fact, scl extends in a natural way to a (pseudo)-norm on the vector space B\(G) 
of real group 1-boundaries, and descends to the quotient Bp (G) := B\(G)/{g n — 
ng,g — hgh^ 1 ). In many important cases, scl is a norm on Bp . In every case it 
turns out that Q/H 1 is the dual of Bp with its scl (pseudo)-norm, where Q/H 1 
carries the so-called defect norm 2D(-) (see § 2). This statement is usually known 
as Generalized Bavard Duality, and in this generality is proved in [10]. 

1.3. stable commutator length as a random variable. In this paper we study 
the growth rate of scl and the geometry of Bp on random elements and in random 
subspaces of hyperbolic (and other) groups, with respect to two natural families 
of probability measures. First, we obtain results for random geodesies of length n 
in hyperbolic groups. Secondly, we obtain results for random walks of length n in 
hyperbolic groups, and in groups acting suitably on hyperbolic spaces. These two 
senses of random are conceptually related, but in general become mutually singular 
as n — > oo. 

There are technical subtleties in either case which are somewhat complementary. 
The theory of stable commutator length is well-adapted to the geometry of quasi- 
geodesics in hyperbolic groups, since quasi- geodesity can be certified — and stable 
commutator length estimated — from local contributions. A combing determines a 
geodesic representative of each element in a group, and the Markov process associ- 
ated to an automatic structure allows one to pick a random element of prescribed 
word length. But there is no guarantee that the Markov process in question is 
ergodic. Complcmcntarily, the Markov process defining a random walk is always 
ergodic, but random paths in hyperbolic spaces are typically not quasi-geodesic. 
Nevertheless, in either context we are able to obtain compression results for scl, at 
the deterministic growth rate of n/logn. 

1.4. The geometry of Q as an obstruction. If G — > H is a surjective homo- 
morphism, the pullback Q(H) — > Q{G) is injective. Hence if Q{G) vanishes but 
Q(H) does not, no such surjective homomorphism exists. More generally, no such 
surjective homomorphism exists if Q{G) is finite dimensional but Q(H) is infinite 
dimensional. 

In their seminal paper [4], Bestvina-Fujiwara showed that if if is a nonelemen- 
tary subgroup of a mapping class group, then Q(H) is infinite dimensional. On 
the other hand, Q(T) is finite dimensional whenever T is a lattice in a higher rank 
Lie group; consequently every homomorphism from such a lattice to a mapping 
class group factors through an elementary subgroup, and therefore (by Margulis) 
through a finite group. Independently Hamenstadt [29] and Bestvina-Feighn [3] 
proved a similar theorem for subgroups of Out(F„) (Hamenstadt 's construction 
considers bounded cohomology with twisted coefficients). 

This application of bounded cohomology makes use of Q as an R-vector space, 
but completely neglects its (Banach) geometry. Any homomorphism G — > H is 
nonincreasing for scl. A random walk on G pushes forward under a homomorphism 
to a random walk on H; the growth rate of scl under random walks is therefore 
an obstruction to the existence of such a homomorphism. To apply these ideas in 
practice one must be able to understand the growth rate of scl under random walks 
in the target group with respect to a measure whose support can be arbitrary; when 
H is the target group one must be prepared to consider the case that the support 
of the measure generates a subgroup which is not quasiconvex. Carrying this out 
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for hyperbolic groups, and groups acting in a suitable way on hyperbolic spaces, is 
one of the main goals of this paper. 

1.5. Random dynamical systems in the continuum limit. In 2-dimcnsions, 
Pesin theory says that all the entropy of a diffeomorphism is carried by Smale 
horseshoes, which is to say, by periodic orbits with hyperbolic dynamics. Up to 
"zero entropy" therefore, one aims to recover the essential dynamics of a diffeomor- 
phism from an inverse system of combinatorial data, namely the braid type of the 
diffeomorphism modulo a finite invariant subset. 

Ghys [23] (building on work of Arnold, Sullivan, Tresser, Fathi, Gambaudo and 
others) has proposed to understand the group of area-preserving diffeomorphisms 
of a surface as a kind of "limit" of braid groups of finer and finer meshes of discrete 
points in the surface, and has explained how to use quasimorphisms on such braid 
groups to construct similar quasimorphisms on the transformation groups. This 
gives highly nontrivial information about the algebraic structure of such groups, 
but it is difficult in practice to interpret this information dynamically. 

One can approach this question from the other direction, by taking families of 
random walks in the braid group with respect to finer and finer meshes of discrete 
particles, and trying to obtain a law on random measure-preserving transformations 
in the limit. In order to do this, one needs to obtain a uniform modulus of continuity 
on the motion of the discrete particles on every fixed scale, to ensure that the 
limiting transformations are continuous. One way to obtain such estimates would 
be to show that a random element of a braid or mapping class group has a short 
factorization as a product of elements with support in subsurfaces with simple 
topology — i.e. to given an upper bound on its reducible length. In fact, our 
methods give lower bounds on reducible length of the same order of magnitude as 
(stable) commutator length. It therefore becomes a provocative question whether 
there are complementary upper bounds of the same order of magnitude. 

1.6. Summary of results. We now summarize the remainder of the paper. § 2 
contains a brief review of the theory of stable commutator length and quasimor- 
phisms that we use in the sequel. This material is standard. 

In § 3 we discuss the ergodic theory of geodesic combings on hyperbolic groups, 
and establish the main technical results necessary to obtain estimates on scl of 
random geodesies. Roughly speaking, a combing for a hyperbolic group determines 
a directed graph with edges labeled by generators, so that directed paths of length n 
in the graph correspond bijectively to elements of the group of word length n (with 
respect to some fixed generating set). The directed graph can be thought of as a 
topological Markov chain; assigning transition probabilities to edges determines a 
stationary Markov process (in the usual sense) . There is an assignment of transition 
probabilities of maximal entropy corresponding to the Patterson-Sullivan measure 
on the group; this allows us to define what is meant by a "random element of G of 
length n" in such a way that we can produce such elements by a Markov process. 

The first main technical result in this section is Proposition 3.12. In words, this 
Proposition gives a Chernoff-type estimate for the number of times any given path 
a of length I log nj log A occurs in a random path of length n in an ergodic Markov 
process, where I < 1 is fixed, and A is the entropy of the Markov process. The 
key subtlety is that the length of the subpaths we are counting is not fixed, but 
depends on the length of the big random path. The second main technical result in 
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this section is the content of Lemmas 3.15 and 3.16 which says that the subwords 
of fixed length appearing in different ergodic component of the Patterson-Sullivan 
Markov process are "coarsely equivalent" (as isometry classes of geodesic segments 
in the group) in distribution. 

In § 4 we apply the estimates obtained in § 3 to prove concentration for scl on 
random geodesies in hyperbolic groups at the scale n/logn: 

Hyperbolic Geodesic Theorem 4.10. Let G be a hyperbolic group, and S a 
finite generating set. There are constants C\, C2 > 0, C3 > ; C4 > 1, C5 > 
so that if g is a random element with \g\s € [n — C\,n + C\] conditioned to lie in 
[G,G], then 

P(C 2 n/logn < scl( 5 ) < C 3 n/logn) = 1 - 0(C7" C6 ) 

In fact, we obtain the stronger result c\{g) < C^n/logn, with the same estimate in 
probability. 

The proofs of the upper and lower bounds are somewhat different. The proof 
of the upper bound is related directly to the definition of (stable) commutator 
length, and depends on writing a random g efficiently as a product of commu- 
tators by directly pairing almost inverse subwords of g of length O(logn), using 
the Chcrnoff-type estimate in Proposition 3.12. The proof of the lower bound is 
obtained via Bavard duality, by constructing a homogeneous quasimorphism with 
controlled defect whose value on g is large. The homogeneous quasimorphism is 
a variant on the small counting quasimorphisms considered by Epstein-Fujiwara 
[18], for which, in contrast with the "big" counting quasimorphisms considered by 
Brooks [8], controlling the defect is easy. On the other hand, while obtaining the 
correct order of magnitude estimate of the defect is straightforward, obtaining ex- 
act estimates is substantially harder; this explains the gap between the bounds 
Cinj logn, C^nj logn in the Hyperbolic Geodesic Theorem, which we believe to be 
an artifact of the method of proof, rather than a genuine statistical phenomenon. 
In fact, the authors of [12] conjectured that one should be able to take Ci and C3 
arbitrarily close to log A/6, where A is the growth entropy of G with respect to S 
(i.e. the number such that the ball of radius n in G in the word metric with respect 
to 5* has 6(A") elements). 

In § 5 we obtain universal estimates on scl on random walks in finitely generated 
groups, and obtain precise order of magnitude estimates for hyperbolic groups and 
certain groups acting on hyperbolic spaces. For simplicity we consider random 
walks obtained by repeatedly convolving a symmetric measure of finite support, 
although our methods could presumably be extended to a more general context. 
Because of the symmetry of the Cayley graph of a free group with respect to a 
free generating set, our sharp estimates for scl on random geodesies in free groups 
gives sharp estimates for scl on random walks in free groups; together with the 
monotonicity of scl under homomorphisms, this gives the following universal upper 
bound: 

Universal Upper Bound Theorem 5.5. LetG be a group with a finite symmetric 
generating set S, and let \S\ — 2k. Let g be obtained by random walk on G (with 
respect to S) of length n (even), conditioned to lie in [G,G]. Then for any e > 
there are constants C\ > 1, Ci > so that with probability 1 — 0(C 1 n ) there is 
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an inequality 

scl(.g) < ((fc-l)log(2fc-l) + e)/6fc-n/logn 

The remainder of the section is devoted to finding lower bounds of the same order 
of magnitude for groups acting in a suitable way on hyperbolic spaces. We consider 
a group G acting simplicially on a 8- hyperbolic simplicial complex Y, and let fi 
be a symmetric probability measure with support equal to a finite subset S of G 
(note that we do not assume that S generates G). We say that the triple (G, fi, Y) 
is nondegenerate if it satisfies a short list of axioms, spelled out in Definition 5.7. 
These axioms are satisfied in the following important cases: 

(1) G is a hyperbolic group, Y is the Cayley graph of G, and (S) is a nonele- 
mentary subgroup; 

(2) G is hyperbolic relative to a family of subgroups, Y is the Groves-Manning 
hyperbolic complex certifying relative hyperbolicity (see [27], § 3), and (S) 
is nonelementary; 

(3) G is the mapping class group of a surface, Y is the complex of curves, and 
(S) is not reducible or virtually abelian. 

A discussion of more general examples (including the action of Out(-F„) on certain 
hyperbolic complexes) is deferred to § 5.10. 

In these terms, the main theorem proved in this section is the Hyperbolic Lower 
Bound Theorem: 

Hyperbolic Lower Bound Theorem 5.26. Let Y be a 6 -hyperbolic simplicial 
complex (not assumed to be locally finite), and let G be a finitely generated group 
that acts simplicially on Y. Let fi be a symmetric probability measure of finite 
support on G so that (G, /Li, Y) is nondegenerate, in the sense of Definition 5. 7. 

Let g be obtained by random walk on G (with respect to fi) of length n. Then for 
any C\ there is a Ci > and G3 depending only on S and G, so that with probability 
at least 1 — n~ Cl there is a homogeneous quasimorphism <p on G satisfying the 
following properties: 

(1) 4>(g) >nC 2 / log n; 

(2) D{4>) < G 3 ; 

(3) \<p{h)\ < 2dy{q, hq)Cs/logn for any g e G and any q^Y . 

In particular, for any C\ > there is a constant C > 1 so that if we condition on 
g G [G, G] (for n even), then 

P(C _1 n/logn < sc\(g) < Gn/logn) > 1 - n" Cl 

Two significant corollaries follow immediately: 

Reducible Bound Corollary 5.28. Let n be a symmetric probability measure 
on the mapping class group of E of finite support, and suppose the subgroup it 
generates is not reducible or virtually abelian. Then for any C\ there is C so that 
if g is obtained by random walk in G of length n, then g cannot be expressed as a 
product of fewer than Cnj log n reducible elements, with probability at least l — n~ Cl . 

Growth Obstruction Corollary 5.29. Let G be a group and \i a symmetric 
probability measure with finite support which generates G. Suppose that for any 
e > there is a S > so that if g is obtained by random walk on G of length n 
(even) conditioned to lie in [G,G], we have 

P(scl(g) < e • n/logn) > S 
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Then every homomorphism from G to a hyperbolic group or to a mapping class 
group has virtually abelian image. 

The Growth Obstruction Corollary is the promised quantitative strengthening 
of [4] alluded to in § 1.4. 

In § 5.8 we spell out the analogs of these theorems for parametric families of 
elements, in terms of the geometry of the unit ball in the scl norm of a random 
subspacc of B^(G) for G as above. In words: we show that the unit ball in a ran- 
dom subspace has uniformly bounded geometry (up to rescaling by a deterministic 
factor), and conjecture that it is C° close to a cross-polytope (i.e. to a unit ball in 
an L 1 norm). This conjecture is only known to be true for free groups, by [12]. 

In § 5.10 we consider random walks on non-locally compact Gromov hyperbolic 
spaces in some generality, and prove (under mild conditions) that all the axioms 
from Definition 5.7 are satisfied except possibly the condition of acylindricity. A 
key intermediate step is to show that random walks converge to the (Gromov) 
boundary almost surely. Although this does not have immediate implications for 
scl, it enables us to obtain linear lower bounds on translation length of elements 
obtained by random walk. Precisely, we show: 

Linear Translation Length Theorem 5.34. Let G be a group of isometries of 
a Gromov hyperbolic space Y , which is not necessarily locally compact. Let /i be a 
probability distribution with finite support on G, such that the support of /i generates 
a non- elementary subgroup of G. Then there are constants L > and c < 1 such 
that 

P(r(w n ) < Ln) < 0(c"), 
where w n is the group element obtained by a random walk of length n, and r(w n ) 
is the translation length of w n acting on Y . 

This theorem applies (for example) to the action of Out(F n ) on the free splitting 
complex, and on the complex of free factors, both of which are (Gromov) hyperbolic 

Finally in § 6 we survey what is known for arbitrary groups, and deduce uni- 
versal lower bounds on scl (known to be sharp in certain cases) from the work of 
Bjorklund-Hartnick [5]. Burger-Monod [9] showed that any quasimorphism differs 
by a bounded amount from a harmonic function. In particular, the expectation 
of this (adjusted) quasimorphism on a random walk is constant; i.e. the value of 
this function is a martingale. By applying the martingale central limit theorem, 
Bjorklund-Hartnick were able to obtain a central limit theorem for the distribution 
of values of a quasimorphism under a random walk. Their result applies in great 
generality, but if one specializes to finitely generated groups G with Q(G) finite di- 
mensional and Hi (G) torsion for simplicity, Bavard duality plus the main theorem 
of [5] implies that for any e there are positive constants a, b (depending only on e) 
so that P(a < sc\ n /y/n < b) > 1 — e, where scl„ denotes the value of scl under a 
random walk of length n. Colloquially one could say that scl has growth rate of 
order ^fn in this case. We obtained a special case of the theorem of Bjorklund- 
Hartnick before their work was announced; because the method of proof is more 
geometric, we decided it was worth including in § 6.1. One nice geometric corollary 
we obtain is as follows: 

Area and Winding Theorem 6.5. Fix some angle a and length I. Let P n be a 
random polygon in the hyperbolic plane with (cyclic) vertices po,pi, - ■ ■ ,p n , where 
d(Pi,Pi+i) = £ for each < i < n — 1 and an angle of ±a at each pi with < i < n, 
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with signs independently and uniformly chosen from ±1. Let A n be the algebraic 
area enclosed by P n , and W n the winding number of dP n . Then A n and W n both 
satisfy a central limit theorem with mean 0. 

2. Stable commutator length 

We recall some standard definitions and facts for the convenience of the reader. 
A basic reference for the material in this section is [10]. 

2.1. Stable commutator length. 

Definition 2.1. Let G be a group, and G' its commutator subgroup. Given g e G', 
the commutator length of g, denoted cl(g), is the least number of commutators in G 
whose product is g, and the stable commutator length, denoted scl(g), is the limit 

sc%) = hm ^1 

n^oo Ti 

The following estimates are elementary: 

Lemma 2.2. Let G be a group, andg, h elements of G' . Then there is an inequality 

sc\(gh) < scl(g) + scl(fc) + 1/2 

Lemma 2.3. Let G be a group with finite symmetric generating set S. Let g € G' . 
Then there is a constant C\ depending on G and S so that 

scl(.g) < d\g\s 

where \ ■ \s denotes word length with respect to S. 

2.2. Quasimorphisms. There is a duality between stable commutator length and 
certain functions on G called homogeneous quasimorphisms. 

Definition 2.4. Let G be a group. A function <f> : G — > R is a quasimorphism 
if there is some least non-negative number D{<j>) called the defect, so that for all 
g,h e G, there is an inequality 

\4>(gh)- 4>{g) - <j>{h)\ <D(<p) 

A quasimorphism is homogeneous if, further, it satisfies 4>(g n ) = n<fi(g) for all g e G 
and all n e Z. 

Denote the vector space of all quasimorphisms on G by Q(G), and the subspace 
of homogeneous quasimorphisms by Q(G). 

Lemma 2.5 ([10], Lem. 2.21 and 2.58.). Given any ip £ Q, the homogenization tp, 
defined by 

Tj)(g) = lim ip(g n )/n 

n— >oo 

exists and satisfies \ip — -01 < D{^P)- Moreover, ip is a homogeneous quasimorphism 
with D(ip) < 2D(i/>). 

Bavard duality is the following theorem: 

Theorem 2.6 (Bavard [2]; [10] Thm. 2.70). If G is a group andg e G' then 

sc%)= sup 

We use Bavard duality to obtain lower bounds on stable commutator length. 
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3. Hyperbolic groups 

A standard introduction to the theory of hyperbolic groups is [26]. More spe- 
cialized references for the material in this section are [13], [15] and [11]. 

3.1. Hyperbolic groups. Let G be a group with a generating set S. Let C${G) 
(or just C for brevity) denote the Cayley graph of G with respect to the generating 
set S. The Cayley graph Cs(G) can be made into a geodesic metric space, by giving 
each edge length 1. With this metric, d(id, g) — \g\s- 

Definition 3.1. A group G is hyperbolic if Cs(G) is hyperbolic as a metric space. 
That is, if there is some 5 so that if pqr is a geodesic triangle, any point on the 
geodesic pq is within distance 6 of qr U rp. 

We are casual about identifying G with the vertices of Cg{G), and in this way 
think of G as a metric space. If S is given, we say G is 8-hyperbolic for any 8 as 
above. 

There is an equivalence relation on proper geodesic rays in Cs (G) , where two rays 
are equivalent iff they are a finite Hausdorff distance apart. The set of equivalence 
classes is itself a compact space called the Gromov boundary of G, and denoted 
dooG. The left action of G on itself (or on Cs(Gj) extends to an action on dooG 
by homeomorphisms. 

3.2. Combings. For an introduction to combings, regular languages etc. see [17]. 
Let S* denote the set of finite words in the generating set S, let | • | denote word 

length in S* , and let cval : S* — > G denote the evaluation map. A word w <G S* is a 
geodesic if |eval(u>)|s = \w\. Under evaluation of prefixes, a word w £ S* determines 
a directed path in the Cayley graph Cs(G) from id to eval(w). We denote this path 
by path(iu). 

Suppose A is a finite directed graph (hereafter digraph) with a distinguished 
initial vertex, and with edges labeled by elements of S in such a way that there is 
at most one outgoing edge from each vertex with a given label. 

Let T denote the set of finite directed simplicial paths in X, and r the subset 
starting at the initial vertex. There is an injective map word : r — > S* which 
takes a path 7 to the string of edge labels on the edges it traverses, in order. The 
composition path o word takes paths in To to paths in Cs{G) starting at id. When 
the meaning is clear from context, we denote patho word(7) simply by 7. 

A subset of the form word(Fo) C S* is necessarily prefix-closed, since Tq is closed 
under taking initial subpaths. Furthermore, word(Fo) is a regular language; in fact, 
a subset L C S* is a prefix-closed regular language if and only if there is some X 
with word(r ) = L. 

Definition 3.2. A combing of G (with respect to a generating set S) is a subset 
L C S* for which there is some labeled digraph X as above with word(r ) = L 
such that 

(1) the evaluation map eval : L — > G is a bijection; and 

(2) the words of L are geodesies. 

We say the digraph X parameterizes the combing. 

Note that L is prefix-closed by our conventions. Note also that for a combing, 
the image of 7 e r under path o word is a geodesic in Cs(G). 
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Remark 3.3. Many conflicting definitions of combings appear in the literature. Our 
definition is by no means standard. 

Theorem 3.4 (Cannon [13]). Let G be a hyperbolic group, and let S be a generating 
set. Then there is a combing for G with respect to S . 

Example 3.5. An ordering of S determines a lexicographic order (i.e. dictionary 
order) on S* . In any hyperbolic group G, the language L of lexicographically first 
geodesic representatives is an example of a combing in the sense of Definition 3.2. 

3.3. Markov chains. The digraph A is a topological Markov chain. Associated 
to A is the transition matrix M whose entry counts the number of directed 
edges from vertex i to vertex j. Note that M has a non- negative real eigenvalue A 
do that |£| < A for every other eigenvalue £ of M. 

Two vertices in a digraph are said to be communicating if there are directed 
paths between the vertices in either direction. This defines an equivalence relation 
on A, and we define a component to be a maximal subgraph whose vertices are a 
communicating class. There is a natural digraph C(X) obtained as a quotient of 
A, whose vertices are the communicating classes of A. From the definition it is 
clear that the digraph C{X) has no directed loops. 

If C is a component of A, the adjacency matrix Mq of C has biggest real 
eigenvalue Ac < A. 

Definition 3.6. A component C is maximal if Ac = A. 

In general, there are no constraints on C(X) other than the fact that it has no 
directed loops. However, for G a hyperbolic group, and A a digraph parameterizing 
a combing, we have the following, due (implicitly) to Coornaert: 

Theorem 3.7 (Coornaert [15]; [11] Lem. 4.15). Let G be a hyperbolic group, and 
let X be a digraph parameterizing a combing. Then each directed path in C(X) 
contains at most one maximal component. Equivalently, if A is the maximal real 
eigenvalue of M, there are positive constants C\, C*2 so that 

Ci\ n < \G n \ < C 2 \ n 

There is a natural stationary Markov chain with states the vertices of A, for 
which the whose positive transition probabilities are a subset of the (directed) 
edges of A. This Markov chain is described in detail in [11], § 4. We let Nij denote 
the probability of a transition from state i to state j (note = if My = 0) and 
\i a certain stationary measure for N of maximal entropy. 

Lemma 3.8 ([11] Lem. 4.9-10). N is a stochastic matrix, and \x is stationary for 
N. The support of fi is the union of the maximal components of X. 

For each n, let T n denote the subset of T consisting of paths of length n, and 
let Tq denote the subset starting at the initial vertex. Note that Fq bijects with 
G n under eval o word. There is a probability measure on T n which we denote \i by 
abuse of notation, defined by 

Ml) = M(7(0))A 7(0)7(1) A 7(1)7(2) • • • A 7( „_ 1)7(n) 

Thinking of T n as the cylinder sets in the space T°° of infinite paths, there is an 
associated probability measure on T°° which is invariant under the shift map s 
which takes a path to the suffix obtained by omitting the first vertex. 
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We can also define a measure v on Tq by 

v {l) = ^0,7(1)^7(1)7(2) ' ' ' ^V 7 (n-l)7(n) 

(since 7(0) = by definition for 7 G r o l ), and extend to a measure on Fg . 
The shift map s takes Tg° into r°°, and we have /1 = lim„_ ) . 00 ^ X^"^ 1 s *^- Note 
that ^ is just jU conditioned on T C L (or on Tq C T™ for each n). Note too 
that if the maximal components of X (the support of fj) are all aperiodic, then 
fi = limn^oo s™v, and in general fi = lim„_ ) . 00 ^ Y^T^ s l + ™^ where m is the 1cm 
of the periods of the maximal components. 

3.4. Ergodicity at infinity. For each n, let v n be the probability measure on G 
defined by 

„ _ S| fl | fl <nA-'"'* fl 

E, S | S <„A-H S 

where £ 5 is the Dirac measure supported at g. This extends trivially to a probability 
measure on G U dooG. It turns out that the limit v — Vmi-n—^oo v n exists and is 
supported on dooG, where it is known as a Patterson-Sullivan measure. 

Theorem 3.9 (Coornaert [15], Thm. 7.7). Let v be the Patterson-Sullivan measure. 
The action of G on d^G preserves the measure class of v, and is ergodic. 

The relationship between the measures u n and the measures v on Tq is as follows. 
Let 7 € Tg and let g = eval(word(7)). Define cone(g) to be the image in G of the 
union of the 7' e T™ extending 7. Then 1/(7) = ^(cone(g)) := lim^oo j/„(cone(g)). 
Let 7' € Lq +1 be obtained by extending 7, and let h = eval(word(7')) so that h = gs 
for some generator s. Let i be the terminal vertex of 7, and j the terminal vertex 
of 7'. Then % = u(cone(h))/u(cone(g)). See [11] § 4.3. for details. 

The bijection Fg — > G n lets us compare the ^-measure on Tg with the uniform 
measure on G n . From the relationship between v and Nij , the ^-measure of g G G n 
(i.e. v(cone(g))) depends only on cone type of g, or equivalently on the terminal 
vertex of the corresponding path in Tg . Since there are only finitely many cone 
types, and since every g G G is a uniformly bounded distance from some g' whose 
cone has growth 6(A") (and therefore has v(g') bounded away from 0) we derive 
the following proposition: 

Proposition 3.10 ([11], § 4). There are positive constants Ci, 62,63 with the 
following property. Let g G G n , and let B denote the intersection of the ball of 
radius C\ about g ( in Cs (G) ) with G n . Let Br denote the preimage of B under the 
natural bijection Lq — > G n . Then there is an inequality 

C 2 v{B T )/v{T n ) < \B\/\G n \ < Czv{B T )/v{T%) 

Finally, we state a proposition which describes the typical behavior of a ^-random 
element of Tq . 

Proposition 3.11 ([11], Lcm. 4.10). Let 7 be a v-random element o/Tq. Then 
as n — > 00, with probability 1 — 0(C ] ~ C ' 2 ™), apart from a prefix of size C3 logn, the 
path 7 is entirely contained in a single maximal component C, and it enters (and 
stays in) a given maximal component C with probability /i(C). 

The collective significance of these propositions can be summarized in the fol- 
lowing way: every g G G n is a uniformly bounded distance away from some h G G n 
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corresponding to a path 7 € Tq which is generated (apart from a prefix of length 
O(logra)) by one of finitely many ergodic stationary Markov processes, correspond- 
ing to one of the maximal components of X. 

We will establish properties which hold for typical sequences in each of these sta- 
tionary Markov processes. Sequences in different Markov processes can be compared 
using Theorem 3.9. This will let us establish properties which hold for ^-typical 
.96 G n , and we will deduce that the properties also hold for typical g e G n (with 
respect to the uniform measure). 

3.5. Mixing times and a Chernoff-type estimate. Let X be a finite stationary 
ergodic Markov chain with transition matrix N, and /1 the stationary measure. 
Define measures fi on T n and r°° as before. In our applications, X will be a 
maximal component C(X) of X as above, and fj, will be \x conditioned on C(X). 

If a, 7 € r, define C a (^) to be the number of times a appears as a subsequence 
of 7. Now if 7„ is a /i-random element of T n , then ergodicity of X implies that 
^C a (-f n ) — > /lx(ct) in probability. A Chernoff-type estimate says that the probability 
that the deviation |C <7 (7„)— n/j,(o~)\ is of size <d(n s ) decays exponentially in n, for S in 
a suitable range (the critical exponent is half the exponent of n/i(a), corresponding 
to the distribution in the central limit theorem). For our purposes it is necessary 
to estimate the probability when \a\ — O(logn). 

Let A be the entropy of X (as above), and let m = log A n = logn/logA. Note 
that there are 0(n e ) elements of T of length lm. 

Proposition 3.12. Suppose a has length lm with I < 1. Then for any e > there 
are constants C\ > 1 and C2 > ( depending on X and I but not on n or a) so that 

P (|a(7«) - nn{v)\ < n £+(1 - f)/2 ) = 1 - 0{C^° 2 ) 

In fact, there is a fixed integer L so that for each residue i mod Lm, if 7^ is the 
subpath of 7 of length lm starting at i + kLM, and C a ^ n ) is the number of copies 
of a amongst the j^k, then 

P (\CM - < n^-W) = 1 - 0(Cr C2 ) 

Proof. This proposition is essentially a special case of a theorem of Dinwoodie [16], 
and we sketch the proof, referring to [16] for details (also compare [12] § 2). For 
simplicity, we assume X is aperiodic (the general case is very similar). 

In fact, the first estimate follows from the second, so we prove the second. Choose 
some big fixed integer L, and let i be a residue mod Lm. Let 7^ be the subpath 
of 7 of length im starting at j + kLm. For fixed i, the successive 7^ are not 
independent, but their correlation is extremely small. Explicitly, if Ai < 1 is the 
second largest eigenvalue of the transition matrix TV, then conditional on any given 
value of ji.k, the probability that the first vertex of 7i,fe+i is equal to some j differs 
from fi(j) by \[ L ^ m . So conditioned on any given value of 7^, the probability 
that 7i,fc+i is equal to a is bounded above by /i(er) (1 + \[ L ^ m / /x(cr(0))) and below 
by (u((t)(1 — \[ L e ^ m j fi(a(0))) where cr(0) is the initial vertex of a. If L is sufficiently 
large, these bounds are approximately m(ct)(1 ± n"° 3 ) where C3 may be taken to 
be as large as we like. 

Let C CTi i(7„) be the number of copies of a amongst the 7^. By the argu- 
ment above, C a ^ n ) may be estimated from the Chernoff bound for independent 
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Bernoulli variables; see e.g. [45], Thm. 1.3.13. Explicitly, we obtain an estimate of 
the form 

P (\CM ± n- C >)\ < n^ 1 "^ 2 ) = 1 - 0(Cr C2 ) 

Taking C 3 sufficiently large, we may absorb it into the n e+ ( 1- ^/ 2 term, at the cost 
of increasing e an arbitrarily small amount. 
Summing over i we get 

P (|C(7n) - M<7)l < Lm • n^ 1 "^ 2 ) = 1 - 0(Lm • Cf"^ ) 

Since Lm — O(logn), by adjusting constants the result holds. □ 

3.6. Anti-alignment. 

Definition 3.13. Two oriented geodesic segments 71,72 in Cs{G) are K anti- 
aligned if there is g £ G so that the terminal point of 5(71) is within distance K of 
the initial point of 72, and vice versa. 

By the defining property of <5-hyperbolicity, the translate of 71 is contained in 
the K + 28- neighborhood of the translate of 72, and vice versa. Note the property 
of being K anti- aligned is a property of orbit classes of geodesic segments in Cs(G) 
under the action of G. Given any path 7 G T, the orbit class of 7 in Cs{G) is 
well-defined, so it makes sense to ask if a pair of elements of T are K anti-aligned. 

Proposition 3.14. Fix L > 2 and some constant K. Let 7 be a v-random element 
o/Tq ; and let 7$ denote the successive subpaths 0/7 of length Lm. Then there is 
some e > and constant C\, C2 so that 

P : 7i K anti-aligns some 7' C 7} < n 1_e ) = 1 - 0(Cf"° 2 ) 

Proof. By Proposition 3.11, we can assume that apart from a prefix of size O(logn), 
7 is contained in some maximal component C. 

Now, if 7i,7' subpaths of length Lm are K anti-aligned, then there are disjoint 
7j' C 7, and 7" C 7' of length at least Lm/2 which are K + 2S anti-aligned. Replace 
K by K + 25 and L by L/2, and by abuse of notation let 7, denote successive 
subpaths of length Lm. Then we need only show that 

P : 7» if anti-aligns some 7' disjoint from ji} < n 1 ^ 6 ) = 1 — 0(Cf™ 2 ) 

For any given location of 7' in 7, there are 0(n L ) distinct elements of Tq which 
agree with 7 outside 7', all with approximately the same /i-measure. If 7' anti- 
aligns some 7i, then 7' is determined by 7,, by h\,h,2 € G of length < if, and by 
the initial vertex of 7' in X. Hence for each 7, there are only C 3 possible choices 
for 7', and therefore only O(n) choices where we range over all i. In particular, 
conditioning on 7 outside 7', the probability that a given 7' if anti-aligns some 7$ 
that it is disjoint from is at most 0(n 1_L ). The proposition follows. □ 

3.7. Comparing components. Proposition 3.12 lets us very accurately estimate 
the number of copies of a subword a in a random word j n of length n in a maximal 
component C of X, at least when \a\ = lm with I < 1 for m = log A n. 

However, it is necessary to compare such distributions for different maximal 
components. We introduce some notation. 

For each maximal component C, let fi\c be the conditional probability measure 
with support in C. That is, define /u(-)|c = M") / 'M^O m Cj an d m(')Ic =0 on the 
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complement of G. By abuse of notation, this defines a probability measure n\c on 
T, defined on cylinder sets by 

lAcil) = M|c(7(0))-/V 7 (o) 7 (i) ■ • • N l{ 

Proposition 3.12 says that for j n a ^-random path in X conditioned to enter a 
specific component G, we have |C<x(7 n ) — /i|c(c)| < n e+ ^^ e ^ 2 with exponentially 
few exceptions. Under the evaluation map, for each M the measure /x|c on T M 
pushes forward to a probability measure on Gm- We would like to compare 
the measures /i\c on Gm for different maximal components C. 

It is too much to hope that these measures will be equal. However, it turns 
out that each n\ Ci can be obtained from fi\ Cj by (roughly speaking) a random 
convolution process, as described in the following lemma: 

Lemma 3.15. Suppose the Cayley graph Gs(G) is 5-hyperbolic. Then the following 
is true: for each pair of big components d, Cj of X , and for each M, there is a 
map fij : Gm — > Prob(GM) satisfying 

(1) for all g and all h in the support of fi,j(g) there is a € G with \a\ < 5 so 
that d(g,aha~ 1 ) < 26; and 

(2) »\c j (h)=E g »\c i (9)fiA9)(h). 

We express bullet (2) by saying that n\cj is obtained by convolving fi\c. with fcj. 

Proof. In fact, the proof of this lemma is a very simple trick, which is a variation on 
the main trick of [11]. By Kakutani's random ergodic theorem (see [34]; alternately, 
this follows from Proposition 3.12) and the definition of ^i, m , almost every infinite 
path in X that enters Cj is composed of subpaths of length M that are distributed 
in Gm according to /i|cv Call an infinite path (i.e. an element of T) Ci-typical if 
its subpaths have this property. Let ji be a C^-typical infinite path, and by abuse 
of notation, let -fi denote the corresponding infinite geodesic path in G starting at 
id, limiting to some point in c^G. By ergodicity of the action of G on c^G (i.e. 
Theorem 3.9), there are typical ji and -fj and some a e G so that a ■ ji and have 
the same endpoint, and therefore outside some compact subset, they synchronously 
fellow-travel. It follows that we can subdivide a-ji and jj into subpaths of length M, 
throwing away finitely many at the start, so that corresponding subpaths are each 
contained in the (^-neighborhood of each other. If two associated subpaths evaluate 
to g and h in Gm respectively, then g = a\ha2 where each of a\,a,2 has length at 
most 5. Hence d(g,a\ha^ 1 ) < 26. So define fij(g)(h) to be the probability that a 
subpath of -fi evaluating to g fellow-travels (as above) a subpath of jj evaluating 
to h. □ 

We conclude that each individual measure fi\d can be convolved (in the sense 
above) by a weighted average J^Pjfij to obtain a single probability measure [j! on 
Gm- Now, though it might not be true that pi! {v) — yi! (y~ v ) for all v e Gm, this is 
approximately true, up to convolution in the sense of Lemma 3.15: 

Lemma 3.16. Suppose the Cayley graph Gs(G) is 6-hyperbolic. Then the following 
is true: for each M there is a map f : Gm Pto^Gm) satisfying 

(1) for all g and all h is the support of f(g) there is a G G with \a\ < 5 so that 
d(g,ah~ 1 a~ 1 ) < 28; and 

(2) M'(M = E^'(.9)/(.9)(M- 
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< -'2 

Proof. Choose some N ^> M, so that with probability 1 — 0(C-f n ), a random 
path 7 in T N satisfies |C CT (7) - 7V^| Ci (ct)| < iV 1 -^ f or every a e r M , where Cj is 
the big component that 7 enters (the existence of such an TV given M follows from 
Proposition 3.12). 

We call a 7 for which such an estimate holds (for some i) almost typical. By 
Proposition 3.10, there are almost typical paths 7 and 7' evaluating to g and g' in 
Gn respectively, and satisfying d(g^ 1 ,g') < C4. 

Let 7 _1 denote the path in G from id to g~ l obtained by reversing and translating 
7 (note that 7 _1 will not typically be an element of V). Since subpaths of 7 _1 arc 
in bijection with subpaths of 7 but oriented oppositely, C a (j) = C cr -i(7 _1 ) for all 
a of length M. Since 7 -1 and 7' synchronously fellow-travel, after throwing away 
suffixes of uniformly bounded length (depending only on C4), we can pair subpaths 
of 7 _1 and 7' of length M so that corresponding subpaths are each contained in the 
(^-neighborhood of each other. Taking N — > 00 (for fixed M) the proof follows. □ 

4. scl OF RANDOM GEODESICS IN HYPERBOLIC GROUPS 

We now have nearly all the necessary tools to estimate scl on random elements in 
hyperbolic groups. There is one significant additional complication for hyperbolic 
groups G for which Hi(G) has positive rank, namely that scl is only defined on 
elements in the commutator subgroup. 

We will show in § 4.3 that the relative proportion of [G, G] in the set of elements 
of length [n — C\ , n + C\] (for a suitable constant C\) is of size 0{n~ k / 2 ) where k is 
the rank of H\{G); in particular, it is polynomial in n. It follows that properties of 
words that hold with probability 1 — 0(C^~™ 3 ) in G n will also hold with a similar 
estimate in probability for words conditioned to lie in [G, G]. 

4.1. Almost pairing subwords. 

Proposition 4.1. Let G be a hyperbolic group, and let S be a generating set. Let 

A be the growth rate for G (i.e. the number such that \G n \ = 0(A") ). Then for any 

e > 0, there are constants C\ > 1, C2 > so that if g is a random element of G n , 
_ C2 

with probability 1 — 0(C 1 n ) the following holds: 

(1) there is some h with \h\s < 8(5 • n(log A + e)/ logn + o(n/ logn); and 

(2) the product gh^ 1 is in [G, G\; and 

(3) there is an estimate c^gh^ 1 ) < n(logA + e)/21ogn. 

Proof. The proof follows by assembling the results of the previous section. Let 
7 G Tq correspond to g. By Proposition 3.10, it suffices to let 7 be a ^-random 
element of Tq. With the desired probability, we can assume apart from a prefix 
and suffix of size O(logn), that 7 is contained in a single maximal component C. 
Fix some t < 1, and consider the set of successive subpaths 7* of 7 of length £m, 
where m — log nj log A. By Proposition 3.12, the distribution of the 7$ is very close 
to fi\c- By Lemma 3.15 and Lemma 3.16 we can pair most of the 7$ in such a way 
that the evaluation of each pair is approximately inverse; i.e. if gj are the values 
of the pair of segments in G, there are a\, ai with |a,| < 4<5 and gt = aigj 1 a 2 . We 
can therefore cancel each gi with gj at the cost of a commutator plus a word of 
length at most 86. The product of these 83 words is h. All but o(n/logn) can be 
paired in this way; the remainder can be absorbed into h at the cost of adjusting e 
an arbitrarily small amount (for big n). The proof follows. □ 
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4.2. Counting quasimorphisms. We obtain lower bounds on scl via Bavard du- 
ality, by constructing explicit quasimorphisms with uniformly bounded defect, and 
value 0(n/ log n) on random g. The quasimorphisms in question are a variant on 
the small counting quasimorphisms of Epstein- Fujiwara [18]. In fact, it is useful to 
work in the generality of a group acting on a <5-hyperbolic complex, following [20] 
(also see [10], § 3.5). 

Let Y be a 8- hyperbolic simplicial complex {not necessarily locally finite) and 
let G act on Y simplicially. If a is a finite oriented simplicial path in Y, let a^ 1 
denote the same path with the opposite orientation. A copy of a is a translate g ■ a 
for some g £ G. 

Definition 4.2. Let Y be a 5-hyperbolic simplicial complex, and let p £ Y be a 
base point. Let E be a (possibly infinite) collection of oriented simplicial paths in 
Y, and let E _1 denote the collection obtained by reversing the orientations on all 
a £ E. For any oriented simplicial path 7 in Y define 

c s (7) = maximal number of disjoint copies of o £ E contained in 7 

and then for g £ G define 

ce(s) = d(p,g(p)) - inf(lcngth(7) - c E (7)) 

7 

where the infimum is taken over all oriented simplicial paths 7 from p to g(jp). 
Define the small counting quasimorphism by the formula 

Ms) : =ce(s)-c e -i(#) 

A path 7 from p to g(p) realizing the infimum of length^) — Ce(7) is called a 
realizing path for 7. Since the values of this function are integers, a realizing path 
exists. 

Lemma 4.3 (Fujiwara [20], Lem. 3.3). Suppose the length of every a £ E is > 2. 
TTierc any realizing path is a {2 1 A)-quasigeodesic. 

It follows that any realizing path is within distance C\ of a geodesic, where C\ 
depends only on 8. 

Lemma 4.4 (Fujiwara [20], Prop. 3.10). Suppose the length of every a £ E is > 2. 
Then there is a constant Ci depending only on 8 such that D(hs) < Ci. 

Remark 4.5. Actually, Epstein- Fujiwara only consider small counting functions for 
a single a, but the bounds on the geometry of realizing paths and on the defect of 
D(hz) are valid for arbitrary collections E as above. This is by contrast with the 
"big" counting functions introduced by Brooks [8] , in which one counts all (possibly 
overlapping) copies of a, not just a maximal disjoint collection. 

Remark 4.6. As defined, the quasimorphism ce depends on the choice of basepoint 
p. However, different choices of points p give rise to quasimorphisms with the same 
homogenization. Since it is the homogenization we really care about, we gloss over 
this detail. 

We apply this construction to the case that Y = Cs(G) and p = id so that 

d (p,g(p)) = \g\s- 

Proposition 4.7. Let G be a hyperbolic group, and let S be a generating set. 
Let X be the growth rate for G. Then for any e > there are constants C\ > 
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1,6*2 > and G3 depending only on 6 so that if g is a random element of G n , with 
probability 1 — 0(Gf ™ ), there is a homogeneous quasimorphism <fi on G satisfying 
the following properties: 

(1) 4>(g) > nlogA/(2 + e)logn; 

(2) D(<j>) < G 3 ; 

(3) \<t>(h)\ < 2|/i| s logA/(2-e)logn for any heG. 

Proof. By Lemma 3.10 it suffices to prove the theorem for a j/-random element g. 
Fix L = 2 + e, and recall the notation m = log n/ log A. Let 7 g Tq correspond 
to g, and let ji denote the successive subpaths of 7 of length Lm. For a suitable 
constant K (to be determined shortly), let £ be the subset of the 7$ which do not 
K anti-align any 7' C 7. If c s denotes the small counting function associated to 
the set £ (thought of as a collection of orbit classes of subpaths of the Cayley graph 
Cs{G)) then by Proposition 3.14 we obtain an estimate cs(g) > n log A/(2+e) logn. 

Now, there is a constant K so that any (2, 4)-quasigeodesic in a <5-hypcrbolic 
space stays within distance if of a genuine geodesic. If we choose K with this 
property, then by Lemma 4.3 and the definition of £ we have c s -i(<?) = 0, and 
therefore hj:(g) > nlogA/(2 + e)logn. On the other hand, D(<p) < G3 for some 
constant G3 depending only on 5, by Lemma 4.4. 

Finally, if h is any element of word length \h\s, any realizing path for or c s -i 
is a (2, 4)-quasigeodesic, which therefore has length at most 2\h\s + 4 and therefore 
contains at most (2|/i|s + 4)/Lm disjoint paths of length Lm. □ 

In particular, if g e [G,G] then scl(g) > Cin/logn with very high probability, 
and if there is h with \h\s = 0(n/logn) and gh^ 1 e [G,G] then scl(gft, _1 ) > 
Cinj logn. 

4.3. Homology. Proposition 4.1 and Proposition 4.7 together give upper and lower 
bounds on scl(gft. -1 ) for some h with \h\s = 0(n/ logn) and gh^ 1 e [G, G]. If g is in 
[G, G] then so is h, and scl(/i) can be estimated from its length (see Lemma 4.8). We 
will see in this section that the relative proportion of homologically trivial g in G n 
is polynomial of bounded degree, and therefore a random element of G n conditioned 
to lie in [G, G] will have two-sided bounds on scl, with high probability. 
The following lemma is elementary: 

Lemma 4.8. Let G be a group and S a finite generating set. There is a constant 
C\ so that if h is an element of [G,G] then cl(h) < Ci\h\s- 

Proof. For simplicity, we suppose Hi(G) is torsion-free; the general case is not 
significantly harder. 

Assume without loss of generality that £ = and let L be the free abelian 
semigroup generated by S as a set. We think of L as the intersection of the positive 
orthant in M} s \ with l) s \ . The map from S to Hi (G; R) extends by linearity to Rl 5 ' , 
and the kernel if is a rational subspace. It follows that K n L is finitely generated 
as an abelian semigroup (such a generating set is known as a Hilbert basis); see e.g. 
[1] . It follows that if we write ft as a (geodesic) word in the elements of S, there is 
a constant C\ so that there is a subset of letters of cardinality at most C\ whose 
image in homology is trivial. We move these letters to the right, at the cost of at 
most Gi commutators. In other words, we can write h = where each hi 

is in [G, G], where \h\\s < \h\s, where \h2\s < Ci, and where cl(hs) < C\. Since 
there are only finitely many elements of G with | • \s < C\, there is a bound on 
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the commutator length of the homologically trivial ones. Hence \h2\s < G2 and 
therefore by induction, cl(h) < (Ci + G2) • \h\s as required. □ 

The main proposition of this section estimates the relative density of [G, G] in 
G n . 

Proposition 4.9. Let G be a hyperbolic group with finite generating set S, and 
suppose the rank of Hi(G) is k. There are constants C\, Ci so that if g is a 
random element of [n — C\,n + C\] (with the uniform probability), then P(g £ 
[G,G]) > G 2 -n- k / 2 . 

Proof. The case that G is free and S is a free generating set is due to Sharp [44], 
with C\ = 1. However, the proof does not use very particular properties of free 
groups, and generalizes to hyperbolic groups. The key probabilistic component is 
a (standard) local limit theorem for random sums in ergodic (finite, stationary) 
Markov chains. 

For general hyperbolic groups, one cannot apply such local limit theorems di- 
rectly because X might have more than one maximal component; however, we can 
apply local limit theorems to each maximal component individually. To do this we 
need to know that the expected value in -Hi(G) of (the evaluation of) a random 
walk conditioned to lie in a maximal component Ci of X is zero. This follows from 
the main theorem of Calegari-Fujiwara [11], since a homomorphism to Z is an ex- 
ample of a bicombable function. It follows that the Markov chains associated to 
each component G, satisfy the conditions in [44], Thm. 2 and we can obtain sharp 
estimates of the desired form on the probability that the evaluation of a random 
path in Ci has trivial abelianization. 

Now, a ^-random 7 £ Tq is of the form 7172 where 72 is contained in some 
maximal component. It is not true that we can bound the length of 71, but it is 
true that there is a constant C\ so that P(|7i| < C\) > 1/2. If a denotes the 
abelianization map a : T n — > Hi(G), then [71 1 < C\ implies a uniform estimate 
H7i)|<<?2. 

Because of the Markov property, we get an estimate for g a ^-random word of 
length n of the form 

P(Hg)\<C 2 )>c 3 -n- k ^ 

By Proposition 3.10, we get a similar estimate (but with different constants) for g a 
random element of G n with the uniform probability. Now, there is a constant G4 so 
that for every g with \a(g)\ < Ci there is h with \h\s < C1C4 so that gh £ [G, G}. 
The map g — > gh is bounded-to-one, so the cardinality of the intersection of [G, G] 
with the set of words of length in the interval [n — CiC^,n + G2G4] is at least 
C b n- k / 2 ■ |G„|. The proof follows. □ 

Putting this all together, we obtain the main theorem in this section. 

Theorem 4.10 (Hyperbolic geodesic theorem). Let G be a hyperbolic group, and 
S a finite generating set. There are constants C\, Ci > 0, G3 > 0, G4 > 1, G5 > 
so that if g is a random element with \g\s £ [n — C\,n + C\] conditioned to lie in 
[G, G], then 

P(G 2 n/logn < scl(«?) < G 3 n/logn) = 1 - 0(C^ nCt> ) 

In fact, we obtain the stronger result c\(g) < C^n/ \ogn, with the same estimate in 
probability. 
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Proof. The estimates in Proposition 4.7 and Proposition 4.1 hold with probability 
1 — 0(C~ n ), and therefore they still hold with the same order of probability (with 
different constants) conditioned on g € [G, G], by Proposition 4.9. Bavard duality 
(Theorem 2.6) therefore gives the lower bound. 

Similarly, for random g conditioned to lie in [G, G], Proposition 4.1 says we 
can write g = gh~ 1 h where cl(g/i -1 ) < Gn/logn, and where \h\s < Gn/logn. 
Since g <£ [G, G] and gh^ 1 e [G, G], we have h G [G, G]. So Lemma 4.8 gives 
cl(h) < Cn/\ogn. Putting this together gives the upper bound on cl(g), with the 
desired estimate in probability. □ 

As remarked in the introduction, the gap between the upper and lower bounds 
is presumably an artefact of the method of proof; in fact in [12] the authors con- 
jectured that there should be concentration for the random variable scl(g) logn/n 
at log A/6, where A is the growth entropy of G with respect to the generating set 
S. The main theorem of [12] proves this for a free group with respect to a free 
generating set (see Theorem 5.1 below for a precise statement). 

5. scl OF RANDOM WALKS IN GROUPS 

In this section we obtain estimates on the value of scl(^) where g is obtained 
by a random walk in a group G, providing G satisfies certain hypotheses. Even 
in a hyperbolic group G with a fixed generating set S, the probability distribu- 
tions defined by random geodesies and by random walks are not usually uniformly 
comparable, and typically become mutually singular at infinity. However, there is 
one very important special case in which the two probability distributions can be 
compared very precisely, namely the case of free groups with a free generating set. 
It follows that we obtain upper bounds on scl in random words in free groups, with 
high probability. 

The significance of this is not that we are interested in free groups per se, but 
rather that scl is monotone under homomorphisms. If G is any group, and S any 
(symmetric) generating set, then there is a surjective homomorphism Fs — > G 
where Fs is the free group on S, and (simple) random walk on Fs pushes forward 
to random walk on G. It follows that any upper bound on scl on random walks in 
free groups gives a universal upper bound on scl on random walks in any group G. 

These upper bounds are complemented by lower bounds for hyperbolic groups, 
and for certain groups acting on hyperbolic spaces (e.g. braid groups, mapping 
class groups). Universal lower bounds, valid for all groups, are obtained by a quite 
different method, and discussed in the next section. 

5.1. Special case: free groups. For F a free group of rank k, Calegari- Walker 
obtained a sharpening of Theorem 4.10: 

Theorem 5.1 (Calegari- Walker [12], Thm. 4.1). Let F be a free group of rank k, 
and let g be a random element of length n in a free generating set where n is even, 
and g is conditioned to lie in [F, F] . Then for every e > and every C there is an 
estimate 

P(|scl(#) log n/n - log(2fc - 1)/6| < e) = 1 - 0(n" c ) 

In fact, the upper bound (i.e. scl(gr) log n/n — log(2A; — l)/6) < e) is proved 
to hold with probability 1 - 0(G~" C ) for some C > l,c > 0; see [12], Prop. 4.2. 
Theorem 5.1 is derived from a proposition, valid for g random of length n, and 
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then conditioning on g G [F, F]. The following proposition is implicit in [12]; for 
completeness, we indicate how it follows immediately from § 4.1-3 of that paper. 

Proposition 5.2. Let F be a free group of rank k. Then for any e > 0, there are 

constants C\ > 1, C2 > and S > so that if g is a random element of F n , with 

_ c 2 

probability 1 — 0(C 1 " ) the following holds: 

(1) there is some h with \h\s < 0(n 1 / 2+£ ); and 

(2) the product gh^ 1 is in [F,F]; and 

(3) there is an estimate scl(g/i _1 ) < n(log(2fc — 1) + e)/61ogn. 

Proof. It is convenient to use the (well-known) extended definition of scl as a norm 
on homologically trivial formal real group 1-boundaries; see [10], § 2.6 for details. 
Lcm. 4.7 [12] says that there is some formal 1-chain V with |r| s = C^n 1 -" 5 ) for 
some 5 > so that sc\(g — T) < n(log(2fc — 1) + e)/61ogn. By an estimate of Rivin 
(see e.g. [43]) we can assume that the L 1 norm of [g] e H\{F) = Z fe is 0(n 1 / 2+£ ) 
for any e, with probability 1 — (9(Cf™ 2 ), and therefore there is h with [g] = [h] 
and \h\ s < 0{n}l 2+€ ). We estimate 

scl^/i" 1 ) <sc\{g + h- l ) + l/2 

< sci(g - r) + sci(r + h- 1 ) + 1/2 

< n(log(2fc - 1) + e)/61ogn + Ofa 1 " 5 ) 

and the 0(n 1_<5 ) may be absorbed into the e. □ 

Remark 5.3. The reader who is uncomfortable with the proof of Proposition 5.2 
can safely use Proposition 4.1 instead in the sequel, after observing that 6 = 
in a free group with a free generating set. The only cost is that the constant in 
Proposition 4.1 is worse by a factor of 6, whereas the constant in Proposition 5.2 
is sharp. 

5.2. Universal upper bounds. We now compare random words with random 
walks. The Cayley graph of F with respect to a free generating set is a regular 
2/c-valent tree. The group of simplicial automorphisms of this tree, fixing the origin, 
acts transitively on the set of vertices at distance m, for any m. Let fi be the uniform 
probability measure on the generators, and let [i* n denote the n-fold convolution; 
i.e. the probability measure associated to a random walk of length n. Then fi* n 
is a weighted sum of uniform measures on the sets F m for m < n. The generating 
function for the weights can be determined explicitly (see e.g. [47] Lem. 1.24), and 
a straightforward calculation shows that for any e, all but 0(C 1 ™ ) of the mass 
of fi* n is concentrated on the set of F m with m/n £ [(k — l)/k — e, (k — l)/fc + e]. 
We therefore we obtain the following proposition: 

Proposition 5.4. Let F be a free group of rank k. Then for any e > ; there are 
constants C\ > 1, C2 > and 5 > so that if g is obtained by random walk on F 
(in a free generating set) of length n, with probability 1 — 0(C 1 n ) the following 
holds: 

(1) there is some h with \h\s < 0(n 1 / 2+£ ); and 

(2) the product gh^ 1 is in [F,F]; and 

(3) there is an estimate scl^/i -1 ) < ((k — l)log(2fc — 1) + e)/6k ■ n/logn. 
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Remarkably, from this elementary estimate, we obtain a universal sharp upper 
bound on scl for random walks in arbitrary finitely generated groups. 

Theorem 5.5 (Universal upper bound). Let G be a group with a finite symmetric 
generating set S, and let \S\ — 2k. Let g be obtained by random walk on G (with 
respect to S) of length n (even), conditioned to lie in [G,G]. Then for any e > 
there are constants C\ > 1, Ci > so that with probability 1 — 0(C 1 n ) there is 
an inequality 

scl(s) < ((fc-l)log(2fc-l) + e)/6fc-n/logn 

Proof. Let <p : G take a free symmetric generating set for Fk to S. Then 

random walk in Fk (with respect to the standard generating set) pushes forward 
to random walk in G with respect to S. Since scl is monotone nonincreasing under 
homomorphisms, the theorem follows from Proposition 5.4 and Lemma 4.8, together 
with the fact that a random walk of length n has probability at least 0(7i~ fc / 2 ) of 
being homologically trivial (for n even) . □ 

5.3. Template for obtaining lower bounds. We will obtain lower bounds, com- 
plementing Theorem 5.5, for random walks in hyperbolic groups, and certain groups 
acting on hyperbolic spaces; the most important example of the latter will be 
(not necessarily quasiconvex) finitely generated subgroups of hyperbolic groups. 
The lower bounds are obtained from the counting quasimorphism construction, 
described in § 4.2; however, the argument is complicated by the fact that a ran- 
dom walk in a hyperbolic group (or in a hyperbolic graph) is almost certainly not 
quasigeodesic. 

The abstract template for obtaining lower bounds is the following somewhat 
technical proposition, which is basically just a restatement of some properties of 
small counting quasimorphisms. 

Proposition 5.6. Let G be a group acting by isometries on a S-hyperbolic simplicial 
complex Y (hot assumed to be locally finite) with a basepoint p. Let g e G be given, 
and let j(g) (or"/ for short) be a geodesic in Y fromp to g(p). We set n — length^). 
Fix C\ > 0, and let % be the successive (nonoverlapping) subpaths 0/7 of length 
Cilogn. Let K{5) (or K for short) be such that any (2, 4) -quasigeodesic in a 5- 
hyperbolic space stays within distance K of a genuine geodesic. Suppose that there 
is an e > for which the following inequality holds: 

#{i : ji K anti-aligns some 7' C 7} < n 1-e 

Then there are constants C 2 > and C3 depending only on C\, e and 5, and C4 
depending only on the action, and a homogeneous quasimorphism (f> on G satisfying 
the following properties: 

(1) <j>(g) >nC 2 / log n; 

(2) D{<f>) < C 3 ; 

(3) \4>{h)\ < 2\h\ s C 4 /\ogn for any heG. 

Proof. Let K be such that any (2, 4)-quasigeodesic in a <5-hyperbolic space stays 
within distance K of a genuine geodesic. Choose K with this property, and let E 
be the set of ji that do not K anti-align some 7' C 7. Let cs denote the small 
counting function associated to the set E, and similarly c s -i. Then by Lemma 4.3 
and the definition of S we have Cx(g) > nCij\o%n and c s -i (g) = 0. Then let be 
the homogenization of h^. 
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To obtain the last bullet point, observe that d(p, h(pj) < C§\h\s for any h e G, 
for some constant C5. □ 

Obtaining lower bounds on scl for random walks thus reduces to showing that 
for certain G, the condition in Proposition 5.6 holds with high probability for g the 
result of a random walk of length n/L (where L is the drift). The main technical 
issue is to relate subwalks of a random walk with subpaths of the geodesic joining 
the endpoints. We address this point in what follows. 

5.4. Nondegenerate random walks. We fix a group G and probability measure 
fi supported on a finite subset S of G so that S = S 1-1 , and /j,(s) = m( s_1 ) fo r eacn 
seS (such a measure is said to be symmetric). We also fix a simplicial action of 
G on a (5-hypcrbolic simplicial complex Y (not assumed to be locally finite) with a 
basepoint p. 

The subset S generates some subgroup (S) of G; we say [i is nonelementary if 
(S) does not fix any finite subset of dooY. 

A random sequence id = go,gi,g2, ■ ■ ■ (finite or infinite) is a [i-random walk in 
G if the successive differences Sj := g^\gi are independent random elements of S 
each with the distribution /i. For such a walk, define the associated fj,-random walk 
in Y, namely the sequence p = Po,Pi,P2 • • • where Pi = gip. Since S is finite, the 
set of distances dy{Pi,Pi+i) is uniformly bounded; we say such a walk has bounded 
increments. 

For any measure /j, of finite first moment on any group acting isometrically on 
any metric space, Kingman's subadditive ergodic theorem implies that there is a 
constant L > called the drift so that L = lim„_>. 00 d(p,p n )/n almost surely. 

Recall that for any points p, q in a hyperbolic space Y and any constant K, the 
shadow S p (q, K) is defined to be the set of all points r e Y so that every geodesic 
from p to r comes within distance K of q. This maybe expressed equivalently (up 
to slightly adjusting the constant R) in terms of the "Gromov product" (x ■ y) p := 
l/2(dy(p, x) + dy(p, y) — dy(x, y)) as the set of points with (q ■ r) p > dy{p, q) — R- 

We will make the following assumptions about G,Y,fi. 

Definition 5.7. Let G be a group acting simplicially on a i5-hyperbolic simplicial 
complex Y, and let \i be a symmetric probability measure with support equal to 
some set S C G. A triple (G, /U, Y) as above is nondegenerate if it satisfies the 
following conditions: 

(1) (nonelementary:) (S) does not fix any finite subset of dooY; 

(2) (positive drift:) the drift L is positive; 

(3) (acylindricity:) for any C\ > there are constants G2, G3 so that if q, r 
are points in Y with dy(q, r) > G2, there are at most G3 elements g E G 
with dy{q,gq) < C\ and dy(r,gr) < C\\ 

(4) (linear progress:) there are constants C\ > 1, G2 > so that 

P(d Y (p ,p n ) G [LC^n.LCm]) > 1 - e- nC * 

(5) (exponential decay:) there are constants C\ and G2 > such that for 
any y eF and any K, the probability that the result of a random walk of 
length n lies in the shadow S Po (y, K) decays exponentially in the distance 
to the shadow; i.e. 

P( Pn e S P0 (y,K)) < Cl e- C ^ d ^°^-V 
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Remark 5.8. We do not claim that every condition in this list is logically necessary; 
rather it reflects the ingredients that go into our proof of Theorem 5.26. It is natural 
to wonder whether the condition of acylindricity could be replaced by Bestvina- 
Fujiwara's weakly properly discontinuous condition (see [4], p. 76), since the latter 
condition is known to hold for a wider class of group actions; but our arguments 
do not seem to easily allow it. 

The following proposition is largely obtained by assembling known results: 

Proposition 5.9. In each of the following cases, (G,fi,Y) is nondegenerate: 

(1) G is a hyperbolic group, Y is the Cayley graph of G with respect to some 
finite generating set, and (S) is nonelementary. 

(2) G is a (strongly) relatively hyperbolic group, Y is the Groves-Manning space 
associated to G, and (S) is nonelementary. 

(3) For some surface E 7 the group G is the mapping class group, Y is the 
complex of curves, and (S) is not reducible or virtually abelian. 

Proof. The nonelementary axiom follows by hypothesis in all three cases. Exponen- 
tial decay in case (3) follows from Maher [39], and again the arguments go through 
verbatim in the setting of a non-elementary action of a group on a proper Gromov 
hyperbolic space, so cases (1) and (2) also follow from this, though presumably the 
result is standard in case (1). We now verify the other properties. 

In case (1), the Cayley graph Y is a proper Gromov hyperbolic space, and acylin- 
dricity follows from the fact that the action of G on itself is properly discontinuous. 
Positive drift holds in cases (1) by Kaimanovich [32], Thm. 7.3, and linear progress 
follows from Kcstcn's estimate [47], Lem. 8.1b for a random walk on a nonamenable 
group. 

In case (2), given a (strongly) relatively hyperbolic group G, Groves and Man- 
ning [27] construct a proper Gromov hyperbolic space Y (called the cusped space 
— see [27] § 3) on which G acts by isometries properly discontinuously, but not co- 
compactly; and acylindricality follows from the fact that the space Y is proper, and 
the action of G on Y is properly discontinuous. Positive drift holds by Kaimanovich 
[32], Thm. 7.3, and linear progress follows from Maher [39]. Although the results 
of [39] are stated in terms of the action of the mapping class group on the complex 
of curves, the results hold in the (simpler) case of an action of a non-elementary 
group on a proper Gromov hyperbolic space Y. 

In case (3), the complex of curves is a locally infinite Gromov hyperbolic simpli- 
cial complex on which the mapping glass group acts discontinuously by simplicial 
isometries, and acylindricity is a theorem of Bowditch [6], Thm. 1.3. Positive drift 
and linear progress follow from Maher [37], Thm. 1.1. □ 

5.5. Proximal points and unfolded walks. Since S is finite, the length of suc- 
cessive steps d(pi,pi+i) is uniformly bounded by a constant, and therefore we can 
think of the random walk as a (coarse) path in Y of length < C\n. We would like 
to use this estimate to show that with very high probability, "most" of the Pi are 
within a bounded distance of the geodesic from po to p n . Actually, it turns out to 
be easier (and just as useful) to show that most points on the geodesic from p to 
p n are within a bounded distance of some pi, and moreover this fact can be de- 
duced directly from linear progress (see Definition 5.7) and elementary hyperbolic 
geometry. 
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Definition 5.10. Let po,Pi, ■ ■ ■ ,Pn be a walk on a (5-hypcrbolic space Y. If for all 
i there is an inequality d(pi,pi+i) < C\ we say p, has Ci-bounded increments. 

Definition 5.11. Let Po,Pi, • • • ,Pn be a walk on a <5-hyperbolic space Y, and let 
7 be a geodesic from p to p n . A point q £ 7 is K -proximal if dy(q,pi) < ^ for 
some We denote the JT-proximal subset of 7 by 7^-. 

Lemma 5.12. Le£ Po>Pi>""" iPn &e a wa/fc on a 6 -hyperbolic space Y with C\- 
bounded increments. Let 7 be a geodesic from pq to p n . Suppose length^) > C^n. 
Then for any e > there is a constant K(e,Ci,C 2 ) so that if "fx denotes the 
K -proximal subset of 7, there is an estimate 

lcngth(7^)/lcngth(7) > 1 — e 

Proof. We assume K ^> C >• 5 for convenience. 

There is a nearest point projection n from the Pi to 7 so that d(n(pi), 7r(p i+1 )) < 
C + 0(5) and consequently every point in 7 is within distance C/2 + 0(5) from 
some n(pi). If p e 7 is not if-proximal, then d(p i7 -f) > K — 0(C) and therefore 
d(Pj,l) > 3K/4-0(C) for < K/AC. On the other hand, d( Pi , Pj ) < \j-i\C 

so any geodesic from pj to Pi does not come within distance K/2 of 7. By (5-thinness, 
we can conclude that d(n(pj), ir(pi)) = 0(6) for \j — i\ < K/AC. 

If the set of non X-proximal points has length at least e • length^), there 
are at least e • length(7)/0(<5) such points whose mutual pairwise distances is 
at least 0(5). To each such point we can associate a sequence of K/2C points 
Pi whose projections to 7 are within 0(5) of it, and therefore these collections 
of points are disjoint. The total number of pi in these collections is at least 
(K/2C)eC- 1 n/0(5) = nKeC- 2 /0(5) so if K > e- 1 C 2 0(5) we get a contradic- 
tion, as desired. □ 

From the linear progress axiom, we deduce the following: 

Lemma 5.13 (proximal). Let (G,^,Y) be nondegenerate. Let p ,--- ,p n be a 
random walk, and 7 a geodesic from p to p n . There are constants C\,Ci > so 
that for any e > there is a further constant K(C\,e) so that 

P ((lcngth( 7 ) > O) A (length(7^)/length(7) > 1 - e)) > 1 - e^ 2 

Note that the first condition implies the second by Lemma 5.12, but it is conve- 
nient to state both conditions explicitly 

Definition 5.14. We say 7 as above is C\-long (or just long if C\ is understood) 
if lcngth( 7 ) > Cm. 

In the sequel we use the convention that oriented geodesies are parameterized 
proportional to arclcngth. We also use the convention that 7 is oriented from p to 
p n , so we write 7(0) = p and 7(1) = p n . 

Now, by definition, for every s,t with 7(5), 7(f) G there are indices i,j so 
that dy(Pi, , y(s)) < K and dy(Pj, 7(t)) < K. It is not necessarily true, however, 
that s < t implies i < j. Nevertheless, this should be true whenever dy('y(s),'y(i)) 
is sufficiently large, with big probability. We quantify this. 

Definition 5.15. Fix some big constant M 3> 0. We say a walk p , ■ ■ ■ ,p n is 

(K, M log n)-unfolded (or just unfolded if K and M are understood) if for every j > i 
with j — i > M logn and every geodesic 7$ from p to p i: we have dy(pj,-fi) > K. 
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Lemma 5.16 (unfolded). Let [G,ji,Y) be nondegenerate. Let po,--- ,Pn be a 
random walk. For any K there is C\ > so that 

P(walk is (K, M log n) -unfolded) > 1 - n 2 - ClM 

Proof. Pick indices i,j so that j > i and j — i > Mlogn. By the linear progress 
property, the walk Pi,Pi + \, ■ ■ ■ ,pj satisfies dy{pi,Pj) > Ci Mlogn with probability 
1 — n~ c ' 2M for suitable C\, C2,. The subset of the JT-neighborhood of 7$ which lies 
outside of B(pi, C\ Mlogn) is contained in a shadow S Pi (y, K + C3), where y is the 
point distance Ci Mlogn from pi along 7^, and C3 only depends on 6, the constant 
of hyperbolicity. By linear progress and the the exponential decay property for 
shadows, the probability that Pj is within distance K of 7^ is at most n~ CiM , for 
some constant C4. Hence 

P ((d Y (pi,Pj) > CiMlogn) A (rfyfe-,7,') > K)) > 1 - n' '^ 
There are fewer than n 2 indices i, j as above; the lemma follows. □ 

Remark 5.17. The condition that a walk should be (K, M log n)-unfolded is very 
strong, probably much stronger than we need. But it does simplify the proofs to 
come. If we just insist that most pairs of indices i,j with j — i > Mlogn are 
"unfolded" in the obvious sense, then the probability will be at least 1 — e~ nCl , for 
sufficiently big (but fixed) M. We do not use this fact in the sequel. 

5.6. Anti-aligned segments and matching. Let Po,p\,-" ,Pn as above be a 
walk in Y with bounded increments, and 7 the geodesic from to n. We assume 7 
is long, and let N — length^). We fix some constants M » and K >• (to be 
determined later). 

Definition 5.18. An R-match (or just a match if R is understood) is a triple 
(a, (3, h) where a, f3 are geodesic segments of 7 of length R, and h € G so that ha 
anti-aligns /?; i.e. d(ha(0), /3(1)) < K and d(ha(l), /3(0)) < K. 

Ultimately we will be concerned with i?-matches where R = Mlogn. It is 
convenient for our i?-matches not to be too close to each other; that can be achieved 
by the following "cut-in-half" lemma: 

Lemma 5.19 (cut in half). Let (a,(3,h) be an R-match, and let jk C 7 so that 
length(7^-na:) > (1— e)R and length(7^n/3) > (1— e)R. Then there are subsegments 
a' C a and p' C /3 with endpoints in jk H a and H /3 respectively, so that 
(a',/3',h) is anR'-match and d Y (a' , /?') > R/2, for some R'/R e [l/4-4e, l/4+4e]. 
Furthermore length(7x H a') > (1 — 5e)i?' anrf similarly for (3' . 

Proof. If ^ is an oriented geodesic, let ^~ denote the initial half of v, and v + the 
terminal half. One of the pairs (a + ,/3~) and (a~,/3 + ) must be disjoint, and there- 
fore one of the pairs (a ++ ,/3~~), • • • , (a~~,/3 ++ ) must have segments separated 
from each other by distance at least R/2. Each of these segments has length i?/4, 
so the length of their intersections with are both at least R/A — eR, and there 
must be further subsets in each of length at least R/4 — 2eR matched by h. Let 
a',f3' be maximal subsegments with endpoints in these subsets. □ 

Definition 5.20. Let 7 be a geodesic in Y of length N. Let en be successive 
subpaths of 7 of length MlogiV, so that there are N/M log N of them. We say 7 
is well-matched if for every subset L of indices with |/| > N /lOMlogiV there are at 
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least 9|/|/10 disjoint geodesies fjj in 7 and elements hj € G and indices € / so 
that (aj(j),/3j, hj) is a match. 

Note that the definition of a match and well-matched implicitly depend on K and 
M. If we need to specify them we use the terminology (K, M log N) well-matched. 

Lemma 5.21. Let G act simplicially on a 5-hyperbolic simplicial space Y. Let p 
be a basepoint, and let 7 be a geodesic from p to g(p) for some jeG. Then there 
are constants K and C\ depending only on S, so that if '7 is not (K, M log N) well- 
matched, there is a counting quasimorphism (f> on G, supported on words of length 
MlogN, with D{4>) < Ci and 0(g) > JV/(100MlogiV). 

Proof. Choose a set of indices / as above witnessing 7's failure to be K well- 
matched. Then no realizing path for g can contain more than 9|/|/10 disjoint anti- 
aligned copies of K. Define cj to be the counting function which counts disjoint 
copies of any of the ai with i e / in a realizing path. We can take 7 itself as a 
realizing path for cj, so that cj(g) > On the other hand, any realizing path for 
q^ 1 for c/ contains at most 9|/|/10 disjoint copies of aj, so <j)(g) := ci(g) — ci(g~ 1 ) > 
\I\/10. □ 

Remark 5.22. We have implicitly used the fact that disjoint copies of a^ 1 on any 
realizing path for g -1 are close to geodesic segments of 7 which overlap only in 
segments of length < 2K + 0(6); trimming these overlaps, we can assume the 
nearby segments in 7 are disjoint. 

It remains to understand when 7 associated to a random walk is (K, M log N) 
well-matched, for suitable constants K, M. 

The next lemma shows that any well-matching can be "relativized" to any subset 
jk with length(7/f)/length(7) > 1 — e. 

Lemma 5.23. Let 7 be a geodesic of length N, and let (3j a collection of at least 
97V/(100Mlog N) disjoint subpaths each of length MlogN. Let Jk be a subset of 
7 with length(7x)/length(7) > 1 — e. Then at least (1 — y/e) of the (3j satisfy 

length^ n f3j) /length(pj) > 1 - 12y/i 

Proof. This is essentially just Chebyshev's inequality. The total length of the f3j is 
at least 9iV/100. For each f3j which fails to satisfy the desired inequality, length(7^n 
(3j) > 12-^/e • length(/3j) and therefore 

eN > length( 7 g) > 12^97V/100 

where S is the proportion of "failing" (3j . We conclude S < y/e as claimed. □ 

Combining Lemma 5.23 with Lemma 5.19 we deduce the following: 

Proposition 5.24 (relative well- matching) . Let 7 be a geodesic in Y of length 
N. Suppose 7 is (K, MlogN) well-matched, and let Jk be a subset of 7 with 
lcngth(7i<-)/length(7) > 1 — e. 

If e is sufficiently small, there are at least 8N/(lOMlogN) segments ctj in 7, 
and 8N/(10MlogN) disjoint segments (3j in -f, each of length at least M log N/5, 
so that for each {3j there is a match (otj,(3j,hj) satisfying 

(1) the endpoints of fij are in 7^- and similarly for a^jy, 

(2) there is an inequality lcngth(7if n/3j)/length(/3j) > 1 — 100y/e and similarly 
for Q!j(j); and 
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(3) d Y ((3 3 ,a lU) )>M log N/2. 

Proof. Of all the successive segments of 7 of length M log TV, let /' be the 
set of indices for which length(7x H aj)/length(aj) > 1 — 12-^/e. Then |7'| > 
(1 — \/e)N/ (M log TV) by Lemma 5.23. Since 7 is well-matched, we can find at least 
9|/'|/10 disjoint geodesies j3j in 7 and matches (a^j) , /3j , hj) with G Apply- 
ing Lemma 5.23 again, there is a subset I C V with |/| > (1 - \ft) 2 N / (M log N) 
so that further length^ n /3j)/length(/3j) > 1 - 12-y/e. 

For each choose some di(j) as above, and relabel it as ctj. Note that although 
the j3j are distinct for different j, we do not assume the aj are distinct for different 
j (for all we know, they might all be the same!). Applying Lemma 5.19 to each 
(aj,(3j,hj) gives rise to new (a'j,f3'j,hj) of at least 1/4— lOO-^/e the length, satisfying 
the desired properties. Notice now that even if on = aj we might have a\ ^ ct'j. □ 

5.7. Uniform lower bounds for hyperbolic spaces. We arc now ready to re- 
turn to probability. 

Lemma 5.25. Let (G,fi,Y) be nondegenerate, and letpo, ■■■ ,p n be a random walk 
of length n. Fix C\ and K. Then there is a constant G2 so that for any M the 
following holds. Consider the collection of indices a<a'<b<c<c'<d for 
which there are geodesies a from p a to pi, and (3 from p c to pd with the following 
properties: 

(1) length(a) > MC\ logn and similarly for (3; 

(2) there is t G [0.1,0.2] so that d Y {p a >,a(l -t))<K and d Y (p c ',P(t)) < K; 

(3) there is some h e G so that d Y {ha(0), /3(1)) < K and d Y (ha(l), /3(0)) < K. 
The probability that this collection of indices is nonempty is at most n 6 ~ c ' 2M . 

Proof. Fix a collection of indices a < a' < b < c < c' < d. The key point is that we 
don't care about a and (3 per se, but only on their G-orbits Ga and G/3, since (3 
is only compared with a translate ha. Speaking somewhat loosely, since b < c, the 
random variables Ga and G/3 are independent. In fact, if we let f3" be a geodesic 
from p c to p c i , and f3' a geodesic from p d to pd then the orbit Gf3' is independent 
of Ga and G/3". 

By acylindricity (this is the only place in the entire argument that acylindricity 
is used), there are only boundedly many h e G with d(ha(l), (3(0)) < K and 
d(ha(l — t),/3(t)) < 3K, providing n is bigger than some universal constant. So 
we can suppose that we are given a fixed geodesic a' of length > (1 — t)MC\ logn 
starting within distance 3-fT of p c > , and we want to estimate the probability that a 
random walk from p c > to pd ends up within distance K of the endpoint ha(0). This 
can be bounded from above by the probability that a random walk started at p c > 
lies in the finite collection of shadows S Pc , (ha(0), K + G2), for some constant G2 
which only depends on S, the constant of hyperbolicity. By the exponential decay 
property for shadows, this probability is at most n~ C3M for some G3. 

Summing over all possible choices of indices gives probability at most n 6 ~ C3M . 

□ 

We deduce our main theorem on nondegenerate random walks on hyperbolic 
spaces. 

Theorem 5.26 (Hyperbolic lower bound). Let Y be a S-hyperbolic simplicial com- 
plex (not assumed to be locally finite), and let G be a finitely generated group that 
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acts simplicially on Y . Let ji be a symmetric probability measure of finite support 
on G so that (G, \i, Y) is nondegenerate, in the sense of Definition 5. 7. 

Let g be obtained by random walk on G (with respect to fi) of length n. Then for 
any C\ there is a C 2 > and C3 depending only on 5 and G, so that with probability 
at least 1 — n~ Cl there is a homogeneous quasimorphism <p on G satisfying the 
following properties: 

(1) 0(g) >nC 2 / log n; 

(2) D(<j>) < C 3 ; 

(3) \<f>(h)\ < 2dy(q, hq)Cs/ logn for any g e G and any q^Y. 

In particular, for any C\ > there is a constant C > 1 so that if we condition on 
g € [G, G] (for n even), then 

P(C _1 n/logn < scl(s) < Cn/logn) > 1 - n~ Cl 

Proof. We use the notation K, M as above. 

Let po, ■ ■ ■ ,p n be the associated walk from the basepoint p to gp, and let 7 be 
the geodesic from p to gp. We assume that 7 is long (see Definition 5.14), that 
the walk is unfolded (see Definition 5.15), and that the conclusions of Lemma 5.25 
hold. By Lemma 5.13, Lemma 5.16 and Lemma 5.25, the probability of all three of 
these things happening is at least 1 — rfi- c ^ M for some Ci, where M is as above. 

Since 7 is long, length^) = Cn for some constant C bounded above and be- 
low. If 7 is not (K, M log (Cn)) well-matched, the desired conclusion follows from 
Lemma 5.21, so we suppose 7 is well-matched. 

By Proposition 5.24 we can find a pair of segments (in fact, many such pairs) a, 
f3 of length Mlog(Cn)/5, with endpoints in with length(7x H a)/length(a) > 
1 — 100 y/e (and similarly for f3), and which are distance at least > Mlog(Cn)/2 
apart in 7. Pick t G [0.1,0.2] with a(l — t) and /3(t) in 7^. Since 7 is unfolded, 
the points a(0), a(l — t), a(l), (3(0), (3(t), (3(1) are within distance K of points p a , 
Pa', Pb, Pc, Pc', Pd on the walk so that the indices appear in the same order that 
the corresponding points appear in 7. 

But the conclusion of Lemma 5.25 says that no such pairs a, (3 can exist, so we 
get a contradiction. It follows that 7 is not well-matched after all, and the desired 
quasimorphism exists. □ 

From Proposition 5.9 we obtain the following corollaries: 

Corollary 5.27. Let G be either a hyperbolic group (resp. the mapping class group 
of a surface S ), and let /x a symmetric probability measure on G with finite support 
generating a nonelementary subgroup (resp. a subgroup which is not reducible or 
virtually abelian). Then for any C\ there is C so that if g is obtained by random 
walk in G of length n (even) and conditioned to lie in [G,G] we have 

P(C _1 n/logn < scl(cj) < Cn/ log n) > 1 - n~ Cl 

Since reducible elements in mapping class groups fix points in the complex of 
curves, we obtain: 

Corollary 5.28. Let p be a symmetric probability measure on the mapping class 
group of S of finite support, and suppose the subgroup it generates is not reducible or 
virtually abelian. Then for any C\ there is C so that if g is obtained by random walk 
in G of length n, then g cannot be expressed as a product of fewer than Cn/ log n 
reducible elements, with probability at least 1 — n~ Cl . 
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From monotonicity of scl under homomorphisms, we obtain: 

Corollary 5.29. Let G be a group and pL a symmetric probability measure with 
finite support which generates G. Suppose that for any e > there is a 5 > so 
that if g is obtained by random walk on G of length n (even) conditioned to lie in 
[G, G], we have 

P(scl(g) < e • n/logn) > S 

Then every homomorphism from G to a hyperbolic group or to a mapping class 
group has virtually abelian image. 

Proof. This just depends on the observation that reducible subgroups of mapping 
class groups are themselves mapping class groups of simpler surfaces. □ 

5.8. Geometry of the scl norm of a random subspace. As remarked earlier, 
stable commutator length is not merely a function, but actually defines a pseudo- 
norm on the space By(G) of (real) group 1-boundaries; i.e. formal real linear 
combinations J2^i9i wrtn U £ K. and gi € G, representing in Hy(G;M.). In 
fact, scl descends to a pseudo-norm on the "homogenized" quotient By (G) := 
Bi{G)/{g-hgh- 1 ,g n - ng) — see [10] § 2.6 for details. 

A nice corollary of Theorem 5.26 is to obtain a priori geometric control over the 
geometry of a random subspace of By '(G) of fixed dimension. 

Theorem 5.30 (Random norm ball theorem). Let Y be a S-hyperbolic simplicial 
complex (not assumed to be locally finite), and let G be a finitely generated group 
that acts simplicially on Y. Let \i be a symmetric probability measure of finite 
support on G so that (G, /x, Y) is nondegenerate, in the sense of Definition 5. 7. 

Fix k and let g\, g 2 , ■ ■ ■ ,g^ be obtained by random walk on G (with respect to n) 
of (even) lengths t\n,l-2,n, ■ ■ ■ ,£kn, all conditioned to lie in [G, G]. Then for any 
C\ there is a C2 > and G3 depending only on S and G, so that with probability at 
least 1 — n 1 , for any formal sum ^2Ugi there is an estimate 

P(G- 1 (^i^n)/logn < scl(^t lft ) < C^t^n)/ log n) > 1 - n~ Cl 

i i 

Proof. We give the argument in the case k = 2 and ly = £ 2 = 1. The general case 
follows by a minor modification of the argument. 

We know that scl(<Ji) and scl (32) are between G _1 n/logn and Gn/logn with 
high probability. By the definition of a norm, scl(#i +52) < G2n/logn. By 
the argument of Theorem 5.26, again with high probability we can find subsets 
Li (i = 1,2) of the indices associated to the quasimorphisms <pi certifying the 
lower bound for scl(gi) so that > (1 — e) and such that if <j>i t 2 is the small 

counting quasimorphism associated to the union of the Ii segments, then 4>i,2(9i) > 
(I — e)<pi(gi). Note that D((j}\^) < G3 for the same constant as in Theorem 5.26, 
since this constant is universal for any small counting quasimorphism. Moreover, 
0i,2(*i5i +hg2) = Mi, 2(51) +£2^1,2(52) > (ii + t 2 )G _1 n/logn. This gives the 
desired estimate in the orthant where the U are both positive. In an orthant in 
which ti is negative, use g^ 1 in place of gi together with the observation that 
U9i = -Ugl 1 in B? . □ 

5.9. Concentration versus compression. It is natural, in view of Theorem 5.1 
to believe that the estimates in Theorem 5.26 and Theorem 5.30 can be sharpened. 
We make the following conjecture: 
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Conjecture 5.31 (Concentration conjecture). Let (G,fi,Y) be nondegenerate in 
the sense of Definition 5.7. Then there is a constant C > and C\ > so that for 
any e> 0, if g is obtained by random walk on G of length n (even), conditioned to 
lie in [G, G], then 

P(|scl(.g) log n/n - C\ < e) > 1 - e~" Cl 

Conjecture 5.31 would imply the following conjecture about the geometry of the 
norm in a random subspace: 

Conjecture 5.32 (Norm conjecture). Let (G,fi,Y) be nondegenerate in the sense 
of Definition 5.7. Then there is a constant C > and C\ > so that for any finite 
integer k and any e > 0, if gi, ■ ■ ■ ,gt are obtained by independent random walks on 
G of length l\n, • • • , t k n (even), conditioned to lie in [G, G], then 

Pflscl^i^iogn/n-C^i^l < e) > 1 - e"" Cl 

It is tempting to conjecture further that C = A/6 where A is the entropy of fi, 
but this might be premature without first understanding a wider range of examples. 

5.10. Random walks on Out(F n ). In this section we summarize our current 
knowledge of the behavior of random walks on Out(_F„). In this context, it is natural 
to consider random walks on the isometry group of a non-locally compact Gromov 
hyperbolic space. In this section we show that if [i is a probability distribution with 
finite support on the isometry group of a non-locally compact Gromov hyperbolic 
space Y, whose support generates a discrete non-elcmcntary subgroup, then all of 
the axioms from Definition 5.7 are satisfied, except possibly for acylindricality. A 
key step is to show that random walks converge to the boundary. 

Theorem 5.33. Let [i be a -probability measure with finite support on the isometry 
group of a (not-necessarily proper) Gromov hyperbolic space Y with basepoint yo, 
whose support generates a discrete non- elementary subgroup. Then almost every 
sample path {g n yo} converges to the Gromov boundary dY , and the hitting measure 
v is non-atomic, and is the unique ^-stationary measure on dY . 

The arguments from [37] and [39] then go through in this case, to give linear 
progress, positive drift and exponential decay. We can then show: 

Theorem 5.34 (Linear translation length). Let G be a group of isometries of a 
Gromov hyperbolic space Y, which is not necessarily locally compact. Let fi be a 
probability distribution with finite support on G, such that the support of \x generates 
a non- elementary subgroup of G. Then there are constants L > and c < 1 such 
that 

P(tK) < Ln) < 0(c n ), 

where w n is the group element obtained by a random walk of length n, and r(w n ) 
is the translation length of w n acting on Y. 

The group Out(_F n ) acts on a (locally infinite) simplicial complex, the complex 
of free factors, which Bestvina and Feighn [3] have shown is Gromov hyperbolic, so 
the results above apply in this case. If the action of Out(F„) on the complex of free 
factors is acylindrical, then this would imply that the growth rate of scl for random 
walks in Out(F„) is n/logn. However, the acylindricality of the action of Out(F„) 
is still unknown. If an element of Out(F n ) acts on the complex of free factors with 



32 



DANNY CALEGARI AND JOSEPH MAHER 



positive translation length, then it is fully irreducible, and so this generalizes results 
of Rivin [42] and Kowalski [35]. Finally, we remark that Out(F n ) also acts on the 
complex of free splittings, which Handel and Mosher [30] have shown is Gromov 
hyperbolic, but in this case the action is definitely not acylindrical. 

We now prove Theorem 5.33. Kaimanovich [32] showed that a random walk on 
a non-elementary subgroup of isometries of a proper Gromov hyperbolic space Y 
converges to the boundary dY almost surely. If Y is proper, then YUdY is compact. 
However, the only place where this is used is to show that there is a weak limit of 
the convolution measures [i* n which is a probability measure on YiidY. Our initial 
task is therefore to show that for a random walk on a non-elementary subgroup of 
isometries of a non-proper Gromov hyperbolic space, the convolution measures n* n 
converge to a ^-invariant probability measure supported on the Gromov boundary 
8Y. 

Lemma 5.35. Let G be the isometry group of a (not necessarily proper) Gromov 
hyperbolic simplicial complex Y, and let p be a probability distribution with finite 
support which generates a non- elementary subgroup of G. Then there is a probability 
distribution v on the Gromov boundary dY which is a weak limit of the convolution 
measures \i*' a . 

As the Gromov boundary is not compact, it will be convenient to consider an 
alternative compactification, namely the horofunction compactification, which we 
now describe. Let Y be a Gromov hyperbolic simplicial space, which is not neces- 
sarily locally compact, and let C(Y,M) be the space of continuous functions on Y, 
with the compact-open topology, which is in this case is equivalent to the topology 
of uniform convergence on compact sets. As K is Hausdorff, C(Y,M) is also Haus- 
dorff. Let yo e Y be a basepoint. There is a map from Y to C(Y,R), defined by 
sending y to the corresponding horofunction, h y (z) = d(z,y) — d(yo,y). Let Yh be 
the closure of Y in C(Y,K), which is sequentially compact, and hence compact as 
C(Y,M) is Hausdorff. The space Yh is known as the horofunction compactification 
of Y, and Yh \ Y is called the horofunction boundary. 

Lemma 5.36. There is a coarsely well-defined G-equivariant map li^YU dY, 
where dY is the Gromov boundary ofY. 

Proof. We start with some preliminary observations about functions h = h y corre- 
sponding to points y e Y, i.e. h y (z) = d(z,y) — d(yo,y). In this case h y achieves 
its minimum value of —d(y ,y) at the point y. Furthermore, for any geodesic 7, 
the restriction of h y to 7 has a coarsely well defined minimum a bounded dis- 
tance away from the closest point projection p of y to 7. The value of h y at p 
is equal to d(p,y) — d(yo,y), up to bounded error depending only on the constant 
of hyperbolicity S, and for any other point 967, the value of h y (q) is equal to 
d(p, q) + d(p, y) — d(yo, y), up to bounded error depending only on 5, i.e. 

(5.10.1) h y (q) -K< d{ P , q) + d(p, y) - d(y , y) < h y (q) + K, 

for some constant K, depending only on 5. 

Given a function h e Yh, consider inf(/i) = inf ye Y h(y), which takes values in 
[—00, 0]. If a horofunction corresponds to a point in Y, i.e h = h y for some i/eF, 
then mi(h y ) = —d(yo,y). We now consider the two cases depending on whether 
inf(/i) > —00, or inf(/i) = —00. 
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If inf(ft) > — oo, then there is a point y such that h(y) < inf(ft) + 5, and we 
shall map ft to this point. We now show that this map is coarsely well defined. 
Let yi and y 2 be two points in Y with h(yi) < inf(ft) + e, and let 7 be a geodesic 
connecting them. Let y n be a sequence of points in Y such that the corresponding 
horofunctions h Vn converge to ft. As 7 is compact, for any number e > there is 
an N such that for all n> N, and for all points 1/S7, \h(y) — h Xn (y)\ < e. Let p n 
be the nearest point projection of y n to 7, then by equation (5.10.1), 

hy n (p n ) < m£(h) + 2e-d(yi,y 2 )/2 + K, 

where K depends only on S, and so d{y\,y 2 ) is bounded by a constant which only 
depends on S. 

If inf (ft) = —00, there is a sequence of points {y n } such that h(y n ) — > —00. We 
now show that this sequence {y n } converges to a point in the Gromov boundary, 
and we shall map ft to this point. Recall that a sequence {y n } converges to the 
boundary if for every number B there is a constant N such that the Gromov 
product (y m ■ y n )y > B f° r all to > TV and n > N. So if the sequence {y n } does 
not converge, then there is a constant B such that for all TV there are points in the 
sequence y n and y m with m> N and n> N such that (y m ■ y n )y < B. 

Let 7 be a geodesic from y n to y m , and let p be the closest point on 7 to the 
basepoint yo- The distance from p to yo is equal to the Gromov product, up to an 
error which only depends on 5, so d(yo,p) < B + K, where K only depends on S. 
In particular this implies that the value of h(p) is at bounded below by — B — K. 
However, the value of ft at any point on 7 is bounded above by the value of ft at 
the endpoints, up to an error which only depends on S, and so as h(y n ) — »■ —00, 
this implies that h(p) — » —00, which is a contradiction. Therefore the sequence y n 
converges to a point in the Gromov boundary. 

Finally, we observe that the property that a sequence y n converges to a particular 
point on the Gromov boundary does not depend on the choice of basepoint for the 
Gromov product, and so this map is G-equi variant. □ 

Consider the convolution measures fi* n on Y^. As Yh is compact, there is a weak 
limit v, which is a /^-stationary probability measure. The measure v pushes forward 
to a /i-stationary probability measure on Y U dY, which by abuse of notation we 
shall also refer to as v. We wish to show that v is supported on the Gromov 
boundary, and this follows as an application of the following lemma: 

Lemma 5.37 ([38], Lcm. 3.5). Consider a random walk generated by a symmetric 
finitely supported probability distribution jj, on the isometry group of a (not nec- 
essarily proper) Gromov hyperbolic space Y, such that the group G generated by 
the support of n is non- elementary. Let v be a fi-stationary probability measure on 
Y U dY, and let | • | G be any proper metric on G. 

Let X be a set with the property that there is a sequence {ki} ie ^ such that for 
any translate gX of X there is a sequence {u»j}j 6 N of elements ofG, such that the 
translates gX, wigX, w 2 gX, . . . are all disjoint, and \w n \ G < k n for all n. Then 
u{X)=Q. 

We shall choose X = B(y ,r), the ball of radius rinF. As G is non-elementary 
it contains a hyperbolic isometry / with translation length greater than r+25. Then 
for any translate gB(y , r), the translates f n gB(y , r) are all disjoint. Choose | • | G 
to be word length with respect to the generating set consisting of the support of /j,. 
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As I/™Ig — n l/lc we ma y choose k n — n |/| G , and then Lemma 5.37 implies that 
v(B(yo,r)) = 0. As this holds for every r, this implies that v(Y) — 0, as required. 
This completes the proof of Lemma 5.35, showing that there is weak limit of the 
convolution measures supported on the Gromov boundary. 

We now prove Theorem 5.33, convergence to the boundary, following the argu- 
ments of Kaimanovich [32] . The action of G on Y U dY satisfies the following two 
properties: 

(CP) If the sequence g n yo converges to a point in dY, then the sequence g n hyo 
converges to the same point, for any h € G. 

(CS) The boundary dY consists of at least three points, and there is a G- 
equivariant Borel map S assigning to pairs of distinct points b\ , b 2 in dY 
subsets (strips) S(bi,b 2 ) C Y, such that for any three pairwise distinct 
points bi e dY, i = 0,1,2, there are neighborhoods bo e Uo C Y, and 
b t e Ui C dY, i = 1, 2 with the property that S(bi,b 2 ) n U = for both 
h eUt,i = 1,2. 

We have shown that (dY, v) is a /x-boundary for G. Furthermore, the action of 
G on Y U dY satisfies properties (CP) and (CS) from Kaimanovich [32], and so all 
hypotheses of [32] Thm 2.4 are satisfied, except compactness. However, in the proof 
of Thm 2.4 compactness is only used to guarantee the existence of a /i-stationary 
probability measure, and we have shown how to construct such a measure above. 
Therefore the conclusion of Thm 2.4 holds in this setting, and this completes the 
proof of Theorem 5.33. 

As we have assumed that the probability distribution jj, has finite support, /i also 
has finite moments and finite entropy, so the arguments of [32] Section 6 also go 
through in this case, and so in fact (dY, v) is the Poisson boundary, though we do 
not need this property for our purposes. 

The properties of exponential decay and linear progress now follow from [37], as 
these arguments only use 5-hypcrbolicity and convergence to the boundary, which 
we have now established. We may now complete the proof of Theorem 5.34. 

Proof, (of Theorem 5.34) We briefly review some properties of quasigeodesics from 
[7] Section III.H, see also Fujiwara [21] Section 1.2. Let a be a path which is 
locally quasigeodesic, i.e. there are constants D,K and c such that every subpath 
of length at most D is a (K, c)-quasigeodesic. Then there is a constant Do, which 
depends only on the constant of hyperbolicity 8, and the quasigeodesic constants 
K and c, such that for all D > D n , the locally quasigeodesic path a is globally a 
(K' , c')-quasigeodesic, where K 1 and c' depend only on 6. A local quasigeodesic 
may be constructed by concatenating geodesic segments with bounded overlaps. 
More precisely, let a and /3 be two geodesies in Y such that the final point of a 
is equal to the initial point of /?, and let 7 be a geodesic from the initial point 
of a to the final point of f). We define the overlap 0(a,(3) of a and (3 to be the 
largest distance from any point on a U (3 to 7. There is a constant Q, which only 
depends on S, such that if a is a path consisting of a union of geodesies, each of 
length at least Q, such that each successive pair overlap by at most 25, then a is a 
quasigeodesic, with quasigeodesic constants depending only on S. 

Let 7 be a geodesic from j/o to gyo, and let 7 be 7 with the reverse orientation. 
Consider the path formed from the union of the geodesic segments g k -f. Let m be 
the midpoint of 7, and let a be a geodesic from m to gm, then the union of the 
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geodesic segments g k a also forms a path in Y. Let B be the overlap of 7 and gj. 
By thin triangles, there is a constant K, which only depends on 6, such that if the 
length of 7 is at least 2B + Q + K, then then length of a is at least Q, and so the 
union of the g k a forms a (K' , c')-quasigeodesic, where K' and c' only depend on 5. 
Furthermore, the distance between m and gm is equal to length^) — 2B, up to an 
additive error which depends only on <5, and this in turn is equal to the translation 
length of g, again up to an additive error which only depends on S. 

If the size of of the overlap between 7 and 57 is at least B, then 7 has an initial 
segment of length B which fellow travels with a final segment of g _1 7, which is 
the initial segment of 7 from y to g~ 1 yo- In particular, this means that g~ 1 yo € 
S Vo (gyo, B + K), for some constant K which only depends on 8. 

We now estimate the probability that this occurs for a random walk of length n. 
Recall that by exponential decay, there are constants K and ci < 1 such that for 
any g e G 



By linear progress, the probability that d(yo,w n y ) < Ln decays exponentially in 
n, which implies 



where C2 < 1 is the exponential decay constant from linear progress. Therefore the 
probability that the overlap of 7 and w n j is at most Ln/4 decays exponentially 
in n. As we have shown that the translation length of w n is equal to length^) — 
20(7,7), U P to additive error depending only on S, this implies that the probability 
that translation length of w n is at least Ln/2 tends to one exponentially fast, as 
required. □ 



In this section we shift our focus abruptly, and concentrate on obtaining uniform 
lower bounds on scl for random walks in arbitrary finitely generated groups. If G 
is any group, then Bavard duality implies that either scl vanishes identically on 
[G, G], or else there is a homogeneous quasimorphism </> on G. In the former case, 
there is nothing to say. In the latter case, one obtains uniform lower bounds on scl 
from an estimate on the distribution of values of (f>. 

It turns out that there is a central limit theorem for <f), proved by Bjorklund 
Hartnick [5] . This theorem allows us to obtain uniform lower bounds on scl of order 
0(^/n). An important special case of this theorem concerns quasimorphisms ob- 
tained from actions of groups on circles (we obtained this special case independently 
of Bjorklund- Hartnick, though the method of proof is similar). In the next section 
we prove the central limit theorem for circle actions, and derive some geometric 
applications, of independent interest. 

6.1. Groups acting on circles. Let Homeo + (S' 1 ) denote the group of orientation- 
preserving homeomorphisms of the circle. This group has a universal central exten- 
sion consisting of the group of homeomorphisms of R that commute with integer 
translation; we denote this group Homeo + (R) z . 




P^n^o € S yo (w n y 



' 4 



\Ln))<Kc} Ln + 0(<$), 



6. Universal lower bounds 
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Poincare defined a function rot : Homeo + (M) z — > R called rotation number by 
the formula 

mt(h) = lim ^ 

n— >oo Ji 

This descends to a function rot : Homeo + (5 1 ) — > R/Z. The function rot is a 
homogeneous quasimorphism on Homeo + (R) z with defect D(rot) — 1. 

Now let G be any group. Suppose G acts on the circle by orientation-preserving 
homeomorphisms; i.e. suppose we have G — > Homeo + (5 1 ). The preimagc of G 
in Homeo + (R) z is a (possibly split) central extension G of G and we obtain a 
homogeneous quasimorphism rot on G. The defect of rot on G is usually equal to 
1, but might be smaller, for instance if the centralizer of G in Homeo + (R) z is bigger 
than the center Z. 

Let G — > Homeo + (R) z be some representation as above, and let S be a finite 
generating set for G (note that G is finitely generated if and only if G is). Let n 
be the uniform probability measure on S, and let /j, n be the n-fold convolution of 
[i. That is, [i n is the probability measure associated to a random walk of length 
n on G in the generators S. Note that we do not require S to be symmetric or to 
generate G as a group. 

Theorem 6.1 (Central limit theorem). Let G be a subgroup of Homeo + (R) z , and 
let S be a finite generating set. Let <7o, <?i, • " " be a Markov process on G, where go = 
id 7 and where each g n is obtained from g n -\ by right multiplication by a random 
element s e S (in the uniform measure). Then there is a central limit theorem 
for rot; i.e there are constants E and a so that n _1 / 2 (rot(<7 n ) — En) converges in 
probability to the Gaussian measure N(0,a). 

Proof. Let G be the image of G in Homeo + (S' 1 ), and let S be the image of S. 
Further, let F be the semigroup generated by S (which might be smaller than G). 
Let M(S 1 ) denote the space of probability measures on S 1 , and let As : M(S 1 ) — > 
MiS 1 ) be defined by 

A fixed point of As is called a stationary (or harmonic) measure on S 1 . We let v be 
an ergodic stationary measure. By construction, the action of F on S 1 is absolutely 
continuous with respect to v. 

If v contains atoms, then there is an atom of biggest measure, supported at some 
point p. Since v{p) is equal to the average of v{sp), it follows that the measure of 
every v{sp) is equal to the measure of v(p). It follows that every point in Fp has 
the same (atomic) measure, and therefore this set is finite and ^-invariant. Since 
the action is by homeomorphisms, this set is invariant under G, and therefore G 
preserves a probability measure on S . In this case it is well-known that rot is a 
homomorphism from G to R, and the ordinary central limit theorem applies. So 
we assume in the sequel that v contains no atoms. 

Since v is stationary, for any measurable / C S 1 ) (in particular, for every interval 
I), there is an equality 

j^lW)) =!/(/) 
^ ses 

Let V be the Radon measure on R obtained by identifying R locally with S 1 . Ob- 
serve that V is invariant under integer translation, and furthermore it satisfies 
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V([t,t + 1]) = 1 for all tel. Furthermore, V is evidently stationary for S; i.e. 
jk] S«es ^( s (-0) = "(I) for all measurable IcR. 

Define F : R -> R as follows. For each t choose T < and T«t, and define 

F{t)=V[T,t] -V[T,0] 

(evidently, F does not depend on the choice of sufficiently negative T). For any 
tel. define 



jl£jr( a (i))^ -F(t) 



/(*) 

If u is arbitrary, and / is the interval with extremal points t and u, then 

/(«) = /(*) + (^£w))) = /(*) 



Hence /(•) is constant, and equal to some fixed E, which we call the drift of S. 

It follows from this that the function F(g n (0)) — nE is a (bounded) martingale; 
that is, the expected value of F(g n (0)) ~ nE given g n -\ is F(</„_i(0)) — (n — 1)E. 

Now, if Xi is any martingale with bounded increments, if a 2 is the expectation of 
(Xi + i — Xi) 2 given X\, X2, • • • , Xi, and if t„ is the minimum n so that J27=i °f — 
n, then the Martingale central limit theorem (see e.g. [28], especially Thm. 3.2. 
on page 58) says that X Trl /y^n converges in probability to a normal distribution 
iV(0, 1). In our particular case, it turns out that T n /n converges in probability to 
cr 2 for some constant a. For, the expectation of (F(g n ) — F(g n -i) — E) 2 given g n -i 
depends only on <? n _i(0) mod Z, and since by hypothesis the measure v is ergodic 
for the action of As, the random ergodic theorem (see [22], Thm. 3.1) implies such 
convergence in probability (even in L 1 ). 

Now, |rot(g) — g(0) \ < 1 for any g e G, and moreover \F(t) — t\<l. Therefore a 
central limit theorem for the function F(g n (0)) — nE implies one for rot((? n ) — nE 
and the theorem is proved. □ 

Remark 6.2. If S = S^ 1 (symmetric random walk in a group), then the random 
process is invariant under taking inverses, and therefore E = 0. Similarly, if S is 
conjugated to itself (in Homeo + (R) z ) by some reflection t — > 2C — t, then E = 
—E = by symmetry. 

Corollary 6.3. Let G be a finitely generated subgroup of Homeo + (S' 1 ) ; and let G 
be the preimage in Homeo + (M) z . Suppose further that scl vanishes identically on 
G. Then if scl„ denotes the value of scl on a random walk in G (in some finite 
symmetric generating set), there is some a for which the following is true: 

lim P(a < scln/a^/n < b) = / lm oo) e_a: ^ 2 dx 

n->oo 27T J a 

Proof. For any group G there is a short exact sequence 

-> H\G) -> Q(G) -> Hl{G) -> i? 2 (G) 

where i? 2 denotes bounded cohomology. Since scl vanishes identically on G by 
hypothesis, Q(G)/H 1 (G) = by Bavard duality; i.e. Theorem 2.6. However, the 
coboundary 8vot exists as an element of H 2 {G) whose image in H 2 (G) is nontrivial, 
and equal to the familiar Euler class. 
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Since G is the central extension associated to the Euler class, it follows that 
Q(G)/H 1 (G) is one dimensional, and spanned by rot. So scl(g) = |rot(g)|/2D(rot) 
on the commutator subgroup of G, and the conclusion follows. □ 

Example 6.4. Corollary 6.3 applies to many naturally occurring families of groups, 
including Hilbert modular groups SL(2,0(n)) where 0(n) is the ring of integers 
in Q(v^j) for n square-free, SL(2, Z[l/2]), Thompson's circle group F and certain 
generalized Stein-Thompson groups, and many others. The fact that scl vanishes 
identically on these groups follows from the stronger property that they are bound- 
edly generated by commutators. 

For Hilbert modular groups, this is a consequence of a deep theorem of Carter- 
Keller-Paige, namely [46] Thm. 6.1 which says that if A is the ring of integers in a 
number field K containing infinitely many units, and T is an element of SL(2, A) 
which is not a scalar matrix, then SL(2,A) has a finite index normal subgroup 
which is boundedly generated by conjugates of T. 

The case SL(2, Z[l/2]) is due to Liehl [36] who proves that the group is boundedly 
generated by elementary matrices (which are themselves products of commutators 
of bounded length) . 

The case of Thompson's group is due to Ghys-Sergiescu [24], and some gener- 
alizations are due to Zhuang [48]. For an introduction to Thompson's groups and 
their properties, see [14]. 

6.2. Random turtles in the hyperbolic plane. In this subsection we give a 
geometric application of Theorem 6.1 of independent interest. Consider the fol- 
lowing random process. A turtle starts at the origin in the hyperbolic plane, and 
moves by alternately moving forward some fixed distance £, and by turning cither 
left or right through some fixed angle a. Let Po,Pi, • • ■ denote the locations of the 
turtle after each successive move forward. Note p is the origin, and d(pi,p i+ i) = I 
for each i. We think of the Pi as the vertices of a random polygonal path. 

For each n, let P n be the polygon with n+1 (cyclic) vertices po, p\ , • • • ,p n . There 
are (at least) two natural geometric quantities to associate to P n . If 7 : S 1 — > R 2 is a 
C 1 immersion, the winding numberis the degree of the Gauss map 9 — > j'(9)/\'y'(9)\. 
Moreover, the algebraic area enclosed by 7 is the integral J R2 wind(7,p)darea(p), 
where the local winding number wind(7,p) is the degree of the map 9 — > (7(6*) — 

p)/M0)-p)\. 

Although dP n is only piecewise linear, it can be smoothed canonically by rotating 
the tangent vector left or right through an angle a at each vertex pi with < i < n 
(according to the behavior of the turtle), and then in an arbitrary way at po and p n . 
Thus we can assign to the sequence p n two geometric quantities W n , the winding 
number of dP n , and A n , the algebraic area enclosed by dP n . 

Let oti be the signed turning angle of P n at the vertex p t . Then the Gauss-Bonnet 
theorem for immersed polygons says there is an equality 

2ttVF„ - a i = - A n 

i 

Theorem 6.5 (Area and Winding Theorem). Fix some angle a and length £. Let 
P n be a random polygon in the hyperbolic plane with (cyclic) vertices Po,Pi, ■ ■ ■ ,p n , 
where d(pi,pi+i) = £ for each < i < n — 1 and an angle of ±a at each pi with 
< i < n, with signs independently and uniformly chosen from ±1. Let A n be 
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(as above) the algebraic area enclosed by P n , and W„ the winding number of dP n - 
Then A n and W n both satisfy a central limit theorem with mean 0. 

Proof. Let L and R be the hyperbolic isometries which translate the origin a 
distance £ and then rotate either left or right through angle a. As matrices in 
PSL(2,K), we can take 

/ e £ / 2 cos(a/2) e^ 2 sin(a/2)\ /e^/ 2 cos(a/2) -e £ / 2 sin(a/2)\ 

\-e- £ / 2 sm(a/2) e^ 2 cos(a/2)y {e^/ 2 sin(a/2) e^/ 2 cos(a/2)y 

The successive Pi are obtained from p by right multiplication by a random 
sequence of R's and L's. We think of Pi = Wip where Wi is an element of the free 
semigroup generated by R and L. By exponentiation, the unit tangent circle at each 
point in H 2 is canonically identified with the ideal circle (this defines a canonical 
flat projective connection on the unit tangent bundle). Therefore, the winding 
number can be computed from the (lifted) action on the ideal circle. We think of 
R and L as elements of Homeo + (K) z , normalized to move some basepoint distance 
< 1 in the positive and negative directions respectively. Then |Wj — rot(u>j)| < 1, 
so Theorem 6.1 shows that W n satisfies a central limit theorem. Note that E = 0, 
by the left-right symmetry (see Remark 6.2). 

With notation as in the proof of Theorem 6.1, the central limit theorem for 
W n follows from the martingale property of the function F(w n (0)). However, the 
function F(w n (0)) — X)™=i ai /2 7r i s a l so a martingale, so the same proof gives a 
central limit theorem for A n , as claimed. □ 

Remark 6.6. If a is small enough compared to £, the polygonal path p$,pi,p2, • • • is 
uniformly quasigeodesic. In this case, the areas of successive triangles Po,p n ,Pn+i 
define a Holder continuous function on the one-sided shift space on the alpha- 
bet {L, R}. The Holder continuity amounts to the observation that for four points 
a, b, c, d in H 2 with d(a, b) — d(c, d) = £, the difference of area(o, c, d) and area(6, c, d) 
is bounded by a constant that decays exponentially fast in the distance from b to 
c. The central limit theorem for Holder functions on shift spaces gives a different 
proof in this case. This kind of argument is implicit in [40] and a related argument 
is pursued in [11, 31]. 

Remark 6.7. There is nothing very special (apart from its charm) about the par- 
ticular random model we chose for the polygons P„. We could just as easily fix 
some finite subset S C PSL(2,M) and define a random sequence pi by pi — spi-i 
for some random seS. The winding number and algebraic area of P n satisfy a 
central limit theorem in this case too, and with essentially the same proof. Note in 
this generality, the drift might be nonzero. 

Remark 6.8. It is interesting to study how the statistical quantities associated to A n 
and W n vary as a function of the parameters. Let's fix a and let £ vary, and consider 
the random winding number W n . By the left-right symmetry, the drift E{£) is zero 
for all £. But the standard deviation a(£) undergoes a phase transition: it is zero for 
£ e [2cosh _1 (l/ sin(a/2)), oo), and increases monotonically to a as I decreases to 
0. In this case a(£) is real analytic (as a function of £) on [0, 2 cosh _1 (l/ sin(a/2))). 

6.3. Central limit theorem for arbitrary quasimorphisms. In fact, very 
shortly after proving Theorem 6.1, we learned that a completely general state- 
ment has independently been obtained by Bjorklund-Hartnick [5]. They prove the 
following theorem (in fact, their results hold in considerably greater generality): 
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Theorem 6.9 (Bjorklund Hartnick [5], Thm. 1.1). Let G be a finitely generated 
group, and S a finite generating set. Let f : G — > WL be a quasimorphism, and 
X n := s n ■ ■ ■ si an i.i.d. left-random walk on G in the generating set S. Then f 
satisfies a central limit theorem with respect to X n . Moreover, if the homogenization 
of f is nonzero, then the central limit is non-degenerate. 

By Bavard Duality we immediately conclude the following: 

Corollary 6.10. Let G be any finitely generated group, and suppose Hx{G) is finite. 
Let S be a finite symmetric generating set. Suppose Q(G) is finite dimensional but 
nonzero. Let g n be obtained by random walk of length n with respect to the uniform, 
measure on S. Then for any e there are positive constants a, b depending on e such 
that P(a < scl(g n )/y/n < b) > 1 - e for n > 0. 

Theorem 6.9 generalizes Theorem 6.1 considerably, but the proof turns out not 
to be too much harder. Under very general conditions, Bjorklund-Hartnick show 
that a quasimorphism / on a group G with a probability measure /i (satisfying 
some conditions) has a (bi-)harmonic representative; i.e. there is a function /' that 
differs from / by a bounded amount, and with the property that /' is invariant 
under convolution with [i (the existence of such a harmonic representative was 
also proved in quite a different way by Burger-Monod [9]). It is this harmonic 
feature of /' that lets one prove a CLT using the martingale CLT, as in the proof 
of Theorem 6.1. 

Remark 6.11. We would like to point out that in the first version of their pa- 
per, Bjorklund-Hartnick required that the probability measure on S be symmetric, 
whereas we never required this as a hypothesis. 

Remark 6.12. Burger Monod and Bjorklund-Hartnick both work with individual 
quasimorphisms. A finite collection of quasimorphisms <j>\, <j>2, ■ ■ ■ , <ft m can be put 
together into a single function $ : G — > R m whose coordinates are the (j>i- It makes 
sense to ask for some harmonic representative with |<£>' — $| < oo and a CLT for 
<f>' (and therefore also for <f>). In fact, the arguments in [9, 5] easily generalize to 
this situation. The existence of a harmonic representative is essentially elementary. 
The quasimorphism property implies by definition that there is a constant C so 
that for any g, h we have \l/2($(gh) + ^(gh^ 1 )) - <f>(g)| < C. It follows that for 
symmetric, the n-fold convolutions & n (g) '■= J G ^{gh)dfi* n (h) lie in a precompact 
family, and therefore we obtain in the usual way a fixed point for convolution with 
fi; i.e. a harmonic representative. Following Bjorklund-Hartnick we deduce a CLT 
for $. 

The condition in Corollary 6.10 that H\(G) is finite is an annoying technical 
restriction. The problem is that one cannot perform the "homological correction 
trick" that we used earlier for two reasons. Firstly, there is no compression of 
the support of a Gaussian measure, and therefore conditioning on a rare event can 
change the order of magnitude of the distribution. Secondly, the order of magnitude 
of the error term (i.e. the size of the image in homology) is the same as the term we 
are trying to control, so it is impossible to naively obtain lower bounds, since the 
two terms might cancel. Nevertheless, the following conjectures seem reasonable. 
The first just asserts that the hypothesis that Hi(G) is finite in Corollary 6.10 can 
be removed: 



STATISTICS AND COMPRESSION OF SCL 



41 



Conjecture 6.13 (Local limit conjecture). Let G be any finitely generated group. 
Let S be a finite symmetric generating set. Suppose Q{G) is finite dimensional 
but nonzero. Let g n be obtained by random walk of length n with respect to the 
uniform measure on S, conditioned to lie in [G, G] . Then for any e there are 
positive constants a, b depending on e such that P(a < scl(g n ) / y/n < b) > 1 — e for 
n > 0. 

The second conjecture is more ambitious, and asserts that the 0(y/n) growth 
rate should characterize (finitely presented) groups with Q(G) finite dimensional: 

Conjecture 6.14 (Finite dimensionality). Let G be a finitely presented group, and 
let g n be obtained by random walk of length n conditioned to lie in [G, G\. Suppose 
that for any e there is a positive constant b so that 

P(scl( ffn )/v^ < b) > 1 - e 

for n 0. Then Q(G) is finite dimensional. 

Finally, we ask for finitely presented groups for which scl has intermediate growth: 

Question 6.15. Is there a finitely presented group G so that if g n is obtained by 
random walk of length n conditioned to lie in [G, G], then for any C > and e > 0, 
there is an estimate 

P(scl(g„) < C\/n or scl(g„) > C _1 n/ log n) < e 

forn^>0? 

We believe that it should be possible to produce finitely generated groups with 
the property sought by Question 6.15, by a careful small cancellation argument. 
It would also be very interesting to find examples of (uncountable) transformation 
groups G containing finitely generated subgroups V so that the growth rate of scIg 
on random walk in T is intermediate between y/n and n/logn. 

Question 6.16. Let £ be a closed surface with x(S) < 0, equipped with a smooth 
area form. Let G be the group of Hamiltonian diffeomorphisms of S. Is there a 
well-defined growth rate of scIq on random walk in T for T a "generic" finitely 
generated subgroup of G? Does it grow like n/logn? 
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